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PESTICIDAL AGENTS 

The present invention relates to materials, agents and 
5 compositions having pesticidal activity which derive from 
bacteria, and more particularly from Xenorhabdus species. 
The invention further relates to organisms and methods 
employing such compounds and compositions. 

10 There is an ongoing requirement for materials, agents, 
compositions and organisms having pesticidal activity, 
for instance for use in crop protection or insect- 
mediated disease control. Novel materials are required 
to overcome the problem of resistence to existing 

15 pesticides. Ideally such materials are cheap to produce, 
stable, have a high toxicity (either when used alone or 
in combination) and are effective when taken orally by 
the pest target. ThuB any invention which provided 
materials, agents, compositions or organisms in which any 

20 of these properties was enhanced would represent a step 
forward in the art. 

Xenorhabdus spp. in nature are frequently symbiotically 
associated with a nematode host, and it is known that 
25 this association may be used to control pest activity. 
For instance, it is known that certain Xenorhabdus spp. 
alone are capable of killing an insect host when injected 
into the host's hemocoel. 

30 in addition, one extracellular insecticidal toxin from 
Photorhabdus luminescens has been isolated (this species 
was recently removed from the genus Xenorhabdus, and is 
closely related to the species therein). Thi6 toxin' is 
not effective when ingested, but is highly toxic when 

35 injected into certain insect larvae (see Parasites and 
Pathogens of InBects V0J..2, Eds. Beckage, N. E. et 
al., Academic Press 1993). 
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Also known are certain low-molecular weight heterocyclic 
compounds from P.lumlnescens and X.nematophilus which 
have antibiotic properties when applied intravenously or 
topically (see Rhodes, S.H. et al. r PCT WO 84/01775). 

5 

Unfortunately none of these prior art materials have the 
ideal pesticide characteristics discussed above, and in 
particular, they do not have toxic activity when 
administered orally. 

10 

The present invention provides pesticidal agents and 
compositions from Xenorhabdus species, organisms which 
produce such compounds and compositions, and methods 
which employ these agents, compositions and organisms, 
15 that alleviate some of the problems with the prior art. 

According to one aspect of the present invention there is 
disclosed a method of killing or controlling insect pests 
comprising administering cells from Xenorhabdus species 
20 or pesticidal materials derived or obtainable therefrom, 
orally to the pests • 

A PCT application of CSIRO published as WO 95/00647 
discloses an apparently toxic protein from Xenorhabdus 
25 nematophilus; however no details of the protein's 

toxicity are given, and certainly there is no disclosure 
of its use as an oral insecticide. 

Thus the invention provides an insect icidal composition 
30 adapted for oral administration to an insect, which 

composition comprises a pesticidal material obtainable 
from a Xenorhabdus species, or a pesticidal fragment 
thereof, or a pesticidal variant or derivative of either 
of these. 

35 

The composition may in fact comprise cells of Xenorhabdus 
or alternatively supernatant taken from cultures of cells 
of Xenorhabdus species. However, the composition 
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preferably comprises toxinB isolable from Xenorhabdus as 
illustrated hereinafter. Toxic activity has been 
associated with material encoded by the nucleotide 
sequence of Figure 2. Thus, the composition suitably 
5 comprises a pesticidal material which is encoded by all 
or part of the nucleotide sequence of Figure 2. 
Pesticidal fragments as well as variants or derivatives 
of such toxins may also be employed. 

10 The sequence of Figure 2 is of the order of 40kb in 

length- It is believed that this sequence may encode more 
than one protein, each of which may regulate or be 
insecticidal either alone or when presented together. It 
is a matter of routine to determine which parts are 

15 necessary or sufficient for insecticidal activity. 

As used herein the term ^variant" refers to toxins which 
have modified amino acid sequence but which share similar 
activity. Certain amino acids may be replaced with 

20 different amino acids without altering the nature of the 
activity in a significant way. The replacement may be 
by way of ^conservative substitution ' ■ where an amino 
acid is replaced with an amino acid of broadly similar 
properties, or there may be some non-conservative 

25 substitutions. In general however, the variants will be 
at least 60% homologous to the native toxin, suitably at 
least 70% homologous and more preferably at least 90% 
homologous . 

30 The term ^derivative 1 ' relates to toxins which have been 
modified for example by chemical or biological methods. 

TheBe toxins are novel , and they and the nucleic acids 
which encode them form a further aspect of the invention. 

35 

A preferred Xenorhabdus species is the bacteria 
X.nematophilus. Particular strains of X.nematophilus 
which are useful in the context of the invention are 
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ATTC 19061 strain, available from the National Collection 
of Industrial and Marine Bacteria, Aberdeen, Scotland 
(NCIMB). In addition, suitable strains include two novel 
strains of Xenorhabdus which were deposited at the NCIMB 
5 on 10 July 1997 and were designated with repository 
numbers NCIMB 40886 and NCIMB 40887. These latter 
strains form a further aspect of the invention. 

All strains have common characteristics as set out in the 
10 following Table 1. 



Table 1 



Strains 



Characteristics ATCC 19061 NCIMB 40887 



Gram strain 
Shape/size 

Motile 

Bioluminescent 
Colour on NBTA* 
insecticidal on 
ingestion by 
insects 
Production of 
Antibiotics 
Resistant to 
ampicillin 

(SO^g/ml ) 
colony 
morphology/ 
colour 



negative 
rods up to 
4nm long 
Yes 
No 

blue 

yes 

yes 



yes 

circular 

convex 

cream 



negative 
rods up to 
4nm long 
Yes 
No 

blue 

yes 

yes 



yes 

circular 

convex 

cream 



NCIMB 40886 
"negative 
rods up to 

4nm long 

Yes 

No 

blue 

yes 

yes 

yes 

circular 

convex 

cream 



15 *NBTA (Oxoid nutrient agar containing 0.0025% 
bromothymol blue and 0.004% tetrazolium chloride) 

Preferably the pest target is an insect, and more 
preferably it is of the order Lepidoptera, particularly 
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Pieris brasslcae , Pieris rapae, or Plutella xylostella or 
the order Diptera, particularly Culex quinquefaciatus . 

In a preferred embodiment of the invention, cells from 
5 Xenorhabdus species or agents derived therefrom are used 
in conjunction with Bacillus thuringiensis as an oral 
pesticide. 

In further embodiments, rather than using Bacillus 
10 thuringiensis itself, pesticidal materials obtainable 
from B. thuringiensis (e.g. delta endotoxins or other 
isolates) are used in conjunction with Xenorhabdus 
species. 



15 The term 'obtainable from* is intended to embrace not 

only materials which have been isolated directly from the 
bacterium in question, but also those which have been 
subsequently cloned into and produced by other organisms . 

20 Thus the unexpected discovery that bacteria of the genus 
Xenorhabdus (and materials derived therefrom) have 
pesticidal activity when ingested, and that such bacteria 
and materials can be used advantageously in conjunction 
with B. thuringiensis (and toxins or materials derived 

25 therefrom), forms the basis of a further aspect of the 
present invention. The pesticidal activity of 
B. thuringiensis isolates alone have been well documented. 
However, synergistic pesticidal activity between such 
isolates and bacteria of the Xenorhabdus species (or 

30 materials derived therefrom) has not previously been 
demonstrated. 

In still further embodiments of the invention, culture 
supernatant taken from cultures of Xenorhabdus species, 
35 particularly X. nematophilus, is used in place of cells 
from Xenorhabdus Bpecies in the methods above. 
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All of these methods can be employed, inter fllift r in peBt 
control. 

The invention also makes available pesticidal 
compositions comprising cells from xenorhabdus species, 
preferably X.nematophilus, in combination with B. 
thuringiensis . As with the methods above, a pesticidal 
toxin from B. thuringiensis (preferably a delta endotoxin) 
may be used as an alternative to B. thuringiensis in the 
compositions of the present invention 



10 



15 



20 



25 



Likewise, culture supernatant taken from cultures of 
Xenorhabdus species, preferably, X.nematophilus may be 
used in place of cells from Xenorhabdus species. 

Such compositions can be employed, inter alia, for crop 
protection eg. by spraying crops, or for livestock 
protection. In addition, compositions of the invention 
may be used in vector control . 

The invention further encompasses novel pesticidal agents 
which can be isolated from Xenorhabdus spp. Technigues 
for isolating such agents would be understood by the 
skilled person. 

In particular, such techniques include the separation and 
identification of toxin proteins either at the protein 
level or at the DNA level. 



30 The applicants have cloned and partially sequenced a 
region of DNA from Xenorhabdus NCIMB 40B87 which region 
codes for insecticidal activity and this is shown as 
Figure 2 (SEQ ID NO. 1) hereinafter. Thus in a preferred 
embodiment the invention also provides a toxin which is 

35 encoded by DNA of SEQ ID No. 1 or a variant or fragment 
thereof . 
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The invention also provides a recombinant DNA which 
encodes such a toxin. The recombinant DNA of the 
invention may comprise the sequence of Figure 2 or a 
variant or fragment thereof. Other DNA sequences may 
5 encode, similar proteins as a result of the degeneracy of 
the genetic code. All such sequences are encompassed by 
the invention. 

The sequence provided herein is sufficient to allow 
10 probes to be produced which can be used to identify and 
subsequently to extract DNA of toxin genes. This DNA may 
then be cloned into vectors and host cells as is 
understood in the art. 

15 DNA which comprises or hybridises with the sequence of 
Figure 2 under stringent conditions forms a further 
aspect of the invention. 

The expression ^hybridises with 1 ' means that the 
20 nucleotide sequence will anneal to all or part of the 
sequence of Figure 2 under stringent hybridisation 
conditions, for example those illustrated in ""Molecular 
Cloning' 1 , A Laboratory Manual ' ' by Sambrook, Fritsch and 
Maniatis, Cold Spring Habor Laboratory Press, Cold Spring 
25 Harbor, N.Y. 

The length of the sequence used in any particular 
analytical technique will depend upon the nature of the 
technique, the degree of complementarity of the sequence, 

30 the nature of the sequence and particularly the GC 
content of the probe or primer and the particular 
hybridisation conditions employed. Under high 
stringency, only sequences which are completely 
complementary will bind but under low stringency 

35 conditions, sequences which are 60% homologous to the 

target sequence, more suitably 80% homologous, will bind. 
Both high and low stringency conditions are encompassed 
by the term ""stringent conditions* ' used herein. 

SUBSTITUTE SHEET (RULE 26) 
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Suitable fragments of the DNA of Figure 2, i.e. those 
which encode pesticidal agents may be identified using 
standard techniques. For example, tran6poson 

5 mutagenesis techniques may be used, for example as 

described by H.S. Siefert et al., Proc. Natl. Acad. Sci. 
USA f (1986) 83 , 735-739 . Vectors such as the cosmid 
cHRIMI, can be mutated using a variety of transposons and 
then screened for loss of insectidal activity. In this 

10 way regions of DNA encoding proteins responsible for 
toxic activity can be identified. 

For example, the mini-transposon mTn3(HIS3) can be 
introduced into a toxic Xenorhabdus clone such as cHRIMI, 

15 hereinafter referred to as * clone 1 1 , by electroporating 
cHRIMI DNA into E.coli RDP146 (pLBlOl ) and mating this 
strain with E.coli RDP146 (pOX38 ) , followed by E. coli 
NS2114Sm. The final strain will contain cHRIMlDNA with a 
single insertion of the transposon mTn3(HIS3). These 

20 colonies can be cultured and tested for insecticidal 
activity as described in Example B hereinafter. 
Restriction mapping or DNA sequencing can be used to 
identify the insertion point of mTn3(HIS3) and hence the 
regions of DNA involved in toxicity. Similar approached 

25 can be used with other transposons such as Tn5 and mTn£. 

Site directed mutagenesis of cHRIMI as outlined in 
^Molecular Cloning, A Laboratory Manual" by Maniatis, 
Fritsch and Sambrook, (1982) Cold Spring Harbor, can also 
30 be used to test the importance of specific regions of DNA 
for toxic activity. 

Alternatively, subcloning techniques can be used to 
identify regions of the cloned DNA which code for 
35 insecticidal activity. In this method, specific smaller 
fragments of the DNA are subcloned and the activity 
determined. To do this, cosmid DNA can be cut with a 
suitable restriction enzyme and ligated into a compatible 
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restriction site on a plasmid vector, such as pUC19. 
The ligation mix can be transformed into E. coli and 
transformed clones selected using a selection marker such 
as antibiotic resistance, which is coded for on the 
5 plasmid vector. Details of these techniques are 

described for example in Maniatis et al f supra, (see 
p390-391) and Methods in Molecular Biology, by L.G. 
Davies, M.D. Dibner and J.F. Battey, Elsevier, (see p222- 
224) . 

10 

Individual colonies containing specific cloned fragments 
can be cultured and. tested for activity as described in 
Example 8 hereinafter. Subclones with insecticidal 
activity can be further truncated using the same 
15 methodology to further identify regions of the DNA coding 
for activity. 

The invention also discloses an isolated pesticidal agent 
characterised in that the agent is obtainable from 
cultures of X. nematophilus or variants thereof, has oral 
pesticidal activity against Pieris brassicae, Pieris 
rapae and Plutella xylostella, is substantially heat 
stable to 55°C / is proteinaceous , acts synergisticaily 
with B.thuringiensis cells as an oral pesticide and is 
substantially resistant to proteolysis by trypsin and 
proteinase K. 

By 'substantially heat stable to 55 °C is meant that the 
agent retains some pesticidal activity when tested after 
30 heating the agent in suspension to 55 °C for 10 minutes, 
and preferably retains at least 50% of the untreated 
activity. 

By 'substantially resistant to proteolysis' is meant that 
35 the agent retains some pesticidal activity when exposed 
to proteases at 30°C for 2 hours and preferably retains 
at least 50% of the untreated activity. 
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By 'acts synergistically 1 is meant that the activity of 
the combination of components is greater than one might 
expect from the use of the components individually. For 
example, when used in conjunction with B. thuringlensis 

5 cells as an oral pesticide, the concentration of B. 
thuringiensis cellular material necessary to give 50% 
mortality in a P.brassicae when used alone is reduced by 
at least 80% when it is used in combination the agent at 
a concentration sufficient to give 25% mortality when the 

10 agent is used alone. 

It has been found that the activity of the material is 
retained by 30 kDa cut-off filters but is only partly 
retained by 100 kDa filters. 

15 

Preferably the agent is still further characterised in 
that the pesticidal activity is lost through treatment at 
25 °C with sodium dodecyl sulphate (SDS - 0.1% 60 mins) 
and acetone (50%, 60 mins). 

20 

Clearly the characterising properties of the isolated 
agent described above can be utilised to purify it from, 
or enrich its concentration in, Xenorhabdus species cells 
and culture medium supernatants . Methods of purifying 

25 proteins from heterogenous mixtures are well known in the 
art (eg. ammonium sulphate precipitation, proteolysis, 
ultrafiltration with known molecular weight cut-off 
filters, ion-exchange chromatography, gel filtration, 
etc.)- The oral pesticidal activity provides a 

30 convenient method of assaying the level of agent after 
each stage, or in each sample of eluent. Such 
methodology does not require inventive endeavour by those 
skilled in the art. 

35 The invention further* discloses oral pesticidal 

compositions comprising one or more agents as described 
above. Such compositions preferably further comprise 
other pesticidal materials from non-Xenorhabdus species. 
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These other materials may be chosen such as to have 
complementary properties to the agents described above , 
or act synergistically with it. 

5 Preferably the oral pesticidal composition comprises one 
or more pesticidal agents as described above in 
combination with B. thuringiensis (or with a toxin 
derived therefrom, preferably endotoxin). 

10 Recombinant DNA encoding said proteins also forms a 
further aspect of the invention . The DNA may be 
incorporated into an expression vector under the 
influence of suitable control elements such as promoters, 
enhancers, signal sequences etc. as is understood in the 

15 art. These expression vectors form a further aspect of 
the invention. They may be used to transform a host 
organism so as to ensure that the organism produces the 
toxin . 

20 The invention further makes available a host organism 
comprising a nucleotide sequence coding for a pesticial 
agent as described above. 

Methods of cloning the sequence for a characterised 
25 protein into a host organism are well known in the art. 
For instance the protein may be purified and sequenced: 
as activity is not required for sequencing, SDS gel 
electrophoresis followed by blotting of the gel may be 
used to purify the protein. The protein sequence can be 
30 used to generate a nucleotide probe which can itself be 
used to identify suitable genomic fragments from a 
Xenorhabdus gene library. These fragments can then be 
inserted via a suitable vector into a host organism which 
can express the protein. The use of such general 
35 methodology is routine and non-inventive to those skilled 
in the art. Such techniques may be applied to the 
production of Xenorhabdus toxins other than those encoded 
by the sequence of Figure 2. 
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It may be desirable to manipulate (eg. mutate) the agent 
by altering its gene sequence (and hence protein 
structure) such as to optimise its physical or 
5 toxicological properties. 

It may also be desirable for the host to be engineered or 
selected such that it also expresses other proteinaceous 
pesticidal materials (eg. delta- endotoxin from B. 
10 thuringlensis) . Equally it may be desirable to generate 
host organisms which express fusion proteins composed of 
the active portion of the agent plus these other toxicity 
enhancing materials. 

15 A host may be selected for the purposes of generating 

large quantities of pesticidal materials for purification 
e.g. by using B . thuringiensis transformed with the agent- 
coding gene. Preferably however the host is a plant, 
which would thereby gain improved pest-resistance. 

20 Suitable plant vectors, eg. the Ti plasmid from 

Agrobacterium tumefaciens, are well known in the art. 
Alternatively the host may be selected such as to be 
directly pathogenic to pests, eg. an insect baculovirus. 

25 The teaching and scope of the present invention embraces 
all of these host organisms plus the agents, mutated 
agents or agent-fusion materials which they express. 

Thus the invention makes available methods, compositions, 
30 agents and organisms having industrially applicable 
pesticidal activity, being particularly suited to 
improved crop protection or insect-mediated disease 
control . 

35 The methods, compositions and agents of the present 

invention will now be described, by way of illustration 
only, through reference to the following non-limiting 
examples and figures. Other embodiments falling within 
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the scope of the invention will occur to those skilled in 
the art in the light of these. 

FIGURE 

3 Figure 1 shows the variation with time of the growth of 
X. nematophilus ATCC 19061 and activity of cells and 
supernatants against P. brassicae as described in Example 
3. 

10 Figure 2 shows the sequence of a major part of a cloned 
toxin gene from Xenorhabdus. 

Figure 3 shows a comparison of the restriction maps of 
cloned toxin genes from two strains of Xenorhabdus 
15 (clone 1 above and clone 3 below). 



EXAMPLES 

20 

Example 1 - Use of X. nematophilus cells as an oral 
insecticide 

CELL GROWTH: A subculture of X. nematophilus (ATCC 19061, 
25 Strain 9965 available from the National Collections of 
Industrial and Marine Bacteria, Aberdeen, Scotland) was 
used to inoculate 250 ml Erlenmeyer flasks each 
containing 50 ml of Luria Broth containing lOg tryptone, 
5g yeast extract and 5g NaCI per litre. Cultures were 
30 grown in the flasks at 27 °C for 40hrs on a rotary shaker. 

PRODUCTION OF CELL SUSPENSION: Cultures were centrifuged 
at 5000 x g for 10 mins. The supernatantB were discarded 
and the cell pellets washed once and resuspended in an 
35 equal volume of phosphate buffered saline (8g NaCl, 1.44g 
Na 2 HP0 4 and 0.24g of KH 2 P0 4 per litre) at pH 7.4, 
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ACTIVITY OF CELL SUSPENSION TO INSECTS: The bioassays 
were as follows: P. brassicae: The larvae were allowed 
to feed on an artificial agar-based diet (as described by 
David and Gardiner (1965) London Nature, 207 , 882-883) 

5 into which a series of dilutions of cell suspension had 
been incorporated . The bioassays were performed using a 
series of 5 doseB with a minimum of 25 larvae per dose. 
Untreated and heat-treated (55°C for 10 minutes) cells 
were tested. Mortality was recorded after 2 and 4 days 

10 with the temperature maintained at 25 °C. 

LC50 cells/g diet 

Treatment 2 day? 4 ^aye 

Untreated 5.9 x 10 5 9.8 x 10 4 

15 Treated 55°C 7.1 x 10 5 1.4 x 10 5 



Aedes aegypti: The larva were exposed to a series of 5 
different dilutions of cell suspension in deionised 
water. The biosassays were performed using 2 doses per 
20 dilution of 50 ml cell suspension in 9.5cm plastic cups 
with 25 second instar larvae per dose. Untreated and 
heat-treated (55°C or 80°C for 10 minutes) cells were 
tested. Mortality was recorded after 2 days with the 
temperature maintained at 25 °C. 

25 

Treatment _ 

Untreated 
Treated 55 °C 
30 Treated 80 °C 



LC50 cells/ml 
2 days 

5.1 x 10 6 
7.4 x 10 6 



Culex quinguefaclatus i The larvae were exposed to a 
single concentration cell suspension containing 4 xlO 
cellB/ml. The biosassays were performed using 2 50 ml 
35 cell suspensions in 9.5 cm plastic cups with 25 second 
instar larvae per cup. Untreated and heat-treated (55°C 
or 80 °C for 10 minutes) cells were tested. Mortality was 
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recorded after 2 days with the temperature maintained at 
25°C. 



% Mortality 
Treatment 2 days 

Untreated 100 
Treated 55°C 100 
Treated 80°C 0 



10 Thus these results clearly show that cells from X. 

nematophilus are effective as an oral insecticide against 
a number of insect species (and are particularly potent 
against P. brassicae) . The insecticidal activity is not 
dependent on cell viability (i.e is largely unaffected by 

15 heating to 55 °C which reduces cell viability by >99.9 9%) 
but is much reduced by heating to 80°C / which denatures 
most proteins. 

Example 2 - Use of X. nematophilus supernatant as an oral 
20 insecticide 



CELL GROWTH: Cultures were grown as in Example 1. 

PRODUCTION OF SUPERNATANT: Cultures were centrifuged 
25 twice at lOOOOg for 10 mins . The cell pellets were 
discarded. 



ACTIVITY OF SUPERNATANT TO INSECTS: The Bioassay was as 
follows : 

30 Activity against neonate P. brassicae and two day old 

Pieris rapae and Plutella xylostella larvae was measured 
as for P. brassicae in Example 1, but using a series of 
untreated dilutions of supernatant in place of of cell 
supensions and with mortality being recorded after 4 days 

35 only. 
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LC50 (Ml supernatant/g diet) 
Insect species 4 days 

P. brassicae 22 
5 P. rapae 79 
P. xylostella 135 

In addition, size-reducing activity (62% reduction in 7 
days) against Mamestra brassicae was detected in larvae 
10 fed on an artificial diet containing X. nematophilus 
supernatant (results not shown). 

Thus these results clearly show that the supernatant from 
X. nematophilus culture medium is effective as an oral 
15 insecticide against a number of insect species, and are 
particularly potent against P. brassicae . 

The heating of supernatants to 55°C for 10 minutes caused 
a partial loss of activity while 80 °C caused complete 

20 loss of activity. Activity was also completely lost by 
treatment with SDS (0.1%w/v for 60 mins ) and Acetone (50% 
v/v for 60 mins) but was unaffected by Triton X-100 (0.1% 
60 mins), non-diet P40 (0.1% 60 mins), NaCl (1 M for 60 
mins) or cold storage at 4°C or -20°C for 2 weeks. All 

25 of these properties are consistent with a proteinaceous 
agent . 

The general mode of action of X. nematophilus cells and 
supernatants i.e. reduction in larval size and death 
30 within 2 day6 at high dosages, and other properties, eg. 
temperature resistence, appear to be similar suggesting a 
single agent or type of agent may be responsible for the 
oral insecticide activity activities of both cells and 
supernatants . 

35 

Example 3 - Timescale for appearance of ingestable 
insecticidal activity 
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CELL GROWTH: 1ml of an overnight culture of X. 
nematophllus was used to inoculate an Erlenmeyer flask. 
Cells were then cultured as in Example 1. Growth was 
estimated by measuring the optical density at 600 nm. 

PRODUCTION OF CELL SUSPENSION AND SUPERNATANTS : These 
were produced as in Examples 1 and 2 . 

ACTIVITY OF CELLS AND SUPERNATANTS AGAINST P. BRASS ICAE: 
The cell suspension bioassay was carried out as in 
Example 1, but using a single dose of suspended cells 
equivalent to 50 pi of broth/g diet and measuring 
mortality after 2 days. The cell supernatant bioassay 
was carried out as in Example 2, but using a single dose 
equivalent to 50 pi supernatant/g diet (i.e. more than 
twice the LC50) and measuring mortality after 2 days. 

The results are shown in Fig. 1. Thus these results 
clearly show that cells taken from X. nematophllus 
culture medium are highly effective as an oral 
insecticide against P. brassicae after only 5 hours, and 
supernatants are highly effective after 20 hours. 
Although some slight cell lysis was observed in the early 
stages of growth, no significant cell lysis was observed 
after this point demonstrating that the supernatant 
activity may be due to an authentic extracellular agent 
(as opposed to one released only after cell breakdown). 

Example 4 - Synergy between X. nematophllus cells and 
30 B.thuringiensis powder preparations 

CELL GROWTH AND SUSPENSION: X. nematophllus cells were 
grown and suspended as in Example 1. B. thuringiensis 
strain HD1 (from Bacillus Genetic Stock Centre, The Ohio 
35 State University, Columbus, Ohio 43210, USA) was 
cultured, harvested and formulated into a powder as 
described by Dulmage et al. (1970) J. Invertebrate 
Pathology .15, 15-20. 
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ACTIVITY OF X. NEMATOPHILUS CELLS AND B. THURINGI ENSI S 
POWDER AGAINST P. BRASSICAE : The bioassays was carried 
out using X. nematophilus and B. thuringiensis in 

5 combination or using fl. thuringiensis cell powder alone. 
Bioassays were carried out as in Example 1 but with 
various dilutions of fl. thuringiensis powder in place of 
X. nematophilus. For the combination experiment, a 
constant dose of X. nematophilus cell suspension 

10 sufficient to give 25% mortaility was also added to the 
diet. Mortality was recorded after 2 days. 

LC50 (yg Bt powder/g diet) 

Bioassav m 2 <tays 

15 B.t. alone I* 7 
B.t. plus X. nematophilus 0.09 

These results clearly demonstrate the synergism between 
X. nematophilus cells and B. thuringiensis powder when 
20 acting aB an oral insecticide against P. brassicae. 

Example 5 - Synergy between of X. nematophilus 
supernatants and fl. thuringiensis powder 

25 CELL GROWTH AND PRODUCTION OF SUPERNATANTS: X. 

nematophilus cells were grown and supernatants prepared 
as in Example 2. fl. thuringiensis was grown and treated 
as in Example 4. 

30 ACTIVITY OF X. NEMATOPHILUS SUPERNATANT S AND Bt CELL 
POWDER AGAINST P. BRASSICAE: 

The bioassays were carried out using X. nematophilus 
supernatants and B. thuringiensis in combination or using 
fl. thuringiensis powder alone. The Bioassay against 
35 neonate P. brassicae and two day old Pieris rapae and 

Plutella xylostella larvae were measured as in Example 2 
but with various dilutions of B. thuringiensis in place 
of X. nematophilus. For the combination experiment, a 
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constant dose of X. nematophilus supernatant sufficient 
to give 25% mortality was also added to the diet. 
Mortality was recorded after 4 days. 

5 LC 50 (pg Bt powder/g) 

diet 

Tnsect species Bt alone Bt plus Xn 

P. brass icae 1.4 0.12 

P. rapae 2.5 0.26 

10 P. xylostella 7.2 0.63 

These results clearly demonstrate the synergism between 
X. nematophilus supernatants and B.thuringiensis powder 
when acting as an oral insecticide against several insect 
15 species. The fact that both X. nematophilus cells and 
supernatants demonstrate this synergism strongly suggests 
that a single agent or type of agent is responsible for 
the demonstrated activities. 

20 Example 5 - Characterisation of insecticidal agent from 
X. nematophilus supernatant by proteolysis 

CELL GROWTH AND PRODUCTION OF SUPERNATANTS: X. 
nematophilus cells were grown and supernatants prepared 
25 as in Example 2. 

PROTEOLYSIS OF SUPERNATANT: Culture supernatant (50ml) 
was dialysed against 0.5 M NaCl (3 x 1 1) for 48 hours at 
4°C. The volume of the supernatant in the dialysis tube 
30 was reduced five-fold by covering with polyethylene 

glycol 8000 (Sigma chemicals). Samples were removed and 
treated with either trypsin (Sigma T8253 - 10,000 
units/mg) or proteinase K (Sigma P0390 = 10 units/mg) at 
a concentration of 0.1 mg protease/ml sample for 2 hours 

35 at 30°C. 

ACTIVITY OF PROTEASE TREATED SUPERNATANT AGAINST P. 
BRASSICAE: The boassay against neonate P. brassicae 
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larvae was carried out by spreading 25 pi of each 
•treatment' on the artificial agar-based diet referred to 
in Example 1 in a 4 . 5 cm diameter plastic pot. Four pote 
each containing 10 larvae were used for each treatment . 
Mortalities were recorded after 1 and 2 days. Controls 
using water only, trypsin (0.1 mg/ml) and proteinase K 
(0.1 mg/ml) were also tested in the same way. 





% Mortality 




Treatment 


1 day 


2 days 


Untreated supernatant 


60 


100 


Proteinase K treated supernatant 


45 


100 


Trypsin treated supernatant 


40 


100 


All controls (no supernatant) 


0 


0 



15 



20 



Example 6 

Entomocidp i «rtivitv of other Xenorhabdus 

Using the methodology of Examples 1 and 2, four different 
xenorhaibdus strains were tested against insect pests. 
The results obtained were as follows: 
I) Activity to Pieris brassicae 



Strain deposit 
no/code 

0014 
0015 

NCIMB 40886 



Cells loVgrm diet Supernatant LC50 
% mortality ^il/gram of diet 



100 
100 
80 
100 



0.09 
0.52 
3.73 
0.05 



25 It was found that entomocidal activity of cells and 

supernatant was reduced by more than 99% when all four 
strains were heated at 80°C for 10 minutes. 
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II) Activity to mosquitoes (Aedes aegypti) 
Bacteria added at the rate of 10 7 cells/ml of water 



Strain deposit 
no/code 



Cells 10 c /gnn diet 
% mortality 



0 



0014 



40 



0015 



45 



NCIMB 40886 



95 



5 Furthermore, all strains significantly reduced the growth 
of Helxothis virescens . 

Claain g of toxin genes from strains of XenorhabdUS 
10 Total cellular DNA was isolated from NCIMB 40887 and ATCC 
19061 using a Quiagen genomic purification DNA kit. 
Cells were grown in L borth (lOg tryptone, 5g yeast 
extract and 5g NaCl per 1) at 28°C with shaking (150rpm) 
to an optical density of 1 . 5 A 60 o- Cultures were 
15 harvested by centrif ugation at 4000xg and resuspended in 
3.5mls of buffer Bl (50mM Tris/HCl, 0.05% Tween 20, 0.5% 
Triton X-100, pH7.0) and incubated for 30 mins at 50°C. 
DNA was isolated from bacterial lysates using Quiagen 
100/G tips as per manufacturers instructions. The 
20 resulting purified DNA was stored at -20°C in TE buffer 
(lOmM Tris, ImM EDTA, pH 8.0). 

A representative DNA library was produced using total DNA 
of NCIMB 40887 and ATTC 19061 partially digested with the 

25 restriction enzyme Sau3a. Approximately 20jig of DNA from 
each strain was incubated at 37°C with 0.25 units of the 
enzyme. At time intervals of 10 f 20, 30, 45 and 60 
minutes, samples were withdrawn and heated at 65°C for 15 
minutes. To visualise the size of the DNA fragments, the 

30 samples were electrophoresed on 0.5% w/v agarose gels. 
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The DNA samples v/hich contained the highest proportion of 
30 to 50kb fragments were combined and treated with 4 
units of shrimp alkaline phosphatase (Boehringer) for 15 
minutes at 37°C, followed by heat treatment at 65°C to 
5 inactivate the phosphatase. 

The size selected DNA fragments were ligated into the 
BamHl site of the cosmid vector SuperCos ! (Stratagent) 
and packaged into the Escherichia coli strain XL Blue 1, 
10 using a Gigapack II packaging kit (Stratgene) in 
accordance with the manufacturers instructions. 

To select for cosmid clones with entomocidal activity, 
individual colonies selected on L agar plates containing 

15 25^g/ml ampicillin, were grown in L broth (containing 
25^g/ml ampicillin) overnight at 28°C. Broth cultures 
(50nl) were individually spread onto the surface of 
insect diet contained in 4.5cm diameter pots, as 
described in Example 5. To each container 10 neonate P. 

20 brassicae larvae were added. Larvae were examined after 
24, 72 and 96 hours recording mortality and size of 
surviving larvae. A total of 220 clones of NCIMB 40887 
were tested, of which two were found to cause reduction 
in larval growth and death within 72 hours. Of 370 

25 clones from ATTC 19061, one was found to cause larval 
death within 72 hours. 

Example 8 

ftr*-_ivifc Y Qf c3 ^nft ri toxin nenes to PierA? fei£g&icfle 
30 The three active clones from Example 7 were grown in L 
broth, containing 25jxg/ml ampicillin, for 24 hours at 
28°C, on a rotary shaker at 150rpm. The activity of the 
toxin clones to neonate larvae were performed by 
incorporation of whole broth cultures into inBect diet, 
35 as described in Example 1 . 
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Clone Nq 

i 

2 
3 

Control* 



Strain 

NCIMB 40887 
NCIMB 40887 
ATTC 19061 



LC50 Oil broth /q insect diet) 

13.03 

16.7 

108.7 

No effect at 100|il/g 



*XL1 Blue E. coli broth 

5 

When E. coli toxin clones were heated at 80°C for 10 
minutes and added to the diet at a rate of lOOjil/g, no 
activity to larvae wa6 detected. Highlighting the heat 
sensitivity of the toxins. 

10 

Example 9 

Sequencing of the cloned toxin from NCIMB 4088? 

Cosmid DNA of the entomocidal clone 1 above from NCIMB 

15 40887 was purified using the Wizard Plus SV DNA system 
(Promega) in accordance with the manufacturers 
instructions. A partial map of the cloned fragment was 
obtained using a range of restriction enzymes EcoRl, 
BamHl, Hindi II , Sail and Sacl as shown in Figure 3. DNA 

20 sequencing was intiatiated from pUC18 and pUC19 based 

sub-clones of the coemid, using the enzymes EcoRl, BamHl, 
tfindlll, EcoRV and Pvvll. Sequence gaps were filled 
using a primer walking approach on purified cosmid DNA. 
Sequence reactions were performed using the ABI PRISM™ 

25 Dye Terminator Cycle Sequencing Ready Reaction Kit with 
AmmpliTaq DNA polymerase FS according to the 
manufacturers instructions. The samples were analysed on 
an ABI automated sequencer according to the manufacturers 
instructions. The major part of the DNA sequence for the 

30 cloned toxin fragment is shown in Figure 2. 
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Example 10 

Restriction ma p of cloned toxin from clone 3 

Cosmid DNA of the entomocidal clone 3 above was purified 

5 as described in Example 9. A restriction map of the 
cloned fragment was obtained using the restriction 
enzymes flamHl, Hindi II, Sail and Sacl and this is shown 
in Figure 3. When compared with the map from clone 1 
(Figure 3) it is clear that over the regions which 

10 overlap, the restriction maps are very similar. The 
only detectable difference between the two clones was a 
reduction in size of two tfindlll fragments in clone 3 f 
corresponding to the 11.4kb and 7.2kb Hindi I I fragments 
in clone 1 by approximately 2Kb and 200bp respectively. 

15 These results indicate the overall relatedness of the DNA 
region coding for toxicity in the two bacterial strains. 

Example 11 

Southern Blot Hyb ridisation Experiments 

20 A 10.3kb flamHl-Sall fragment of the DNA from clone 1 was 
used as a probe to hybidise to total Hindi I I digested DNA 
of the Xenorhabdus strains ATCC 19061, NCIMB 40886 and 
NCIMB 40887. Hybridisation was performed with 20ng/ml of 
DIG labelled DNA probe at 65°C for 18 hours. Filters 

25 were washed prior to immunological detection twice for 5 
minutes with 2 x SSC (0.3M NaCl, 30mM sodium citrate, pH 
7.0)/0.1% (w/v) sodium dodecyl sulphate at room 
temperature, and twice for 15 minutes with 0.1 x SSC 
(15mM NaClm 1.5 mM sodium citrate, pH 7.0) plus 0.1% 

30 sodium dodecyl sulphate at 65°C. The probe was labelled 
and experiments performed in accordance with 
manufacturers instructions, using a non-radioactive DIG 
DNA labelling and detection kit (Boehringer ) . The probe 
hybridised to a Hindi 1 1 fragment of approximately 8kb in 

35 all three strains as well as an 11.4kb fragment in NCIMB 
40887 and an approximate 9kb fragment in both NCIMB 40886 
and ATCC 19061. These results show that strains NCIMB 
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40886 and ATCC 19061 contain DNA with close homology to 
the toxin gene of clone 1 above/ confirming the 
similarity between the toxins produced by the three 
strains . 
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CLAIMS 

1. An insecticidal composition adapted for oral 
5 administration to an insect comprising a pesticidal 
material obtainable from a Xenorhabdus species, or a 
pesticidal fragment thereof, or a pesticidal variant or 
derivative of either of these. 

10 2. A composition according to claim 1 wherein the said 
pesticidal material comprises material encoded by the 
nucleotide sequence of Figure 2 or variant or fragment 
thereof, or a sequence which hybridises with said 
sequence . 

15 

3. A composition according to claim 1 or claim 2 which 
comprises cells of Xenorhabdus. 

4. A composition as claimed in any one of the 

20 preceding claims which comprises supernatant taken from 
cultures of cells of Xenorhabdus species. 

5. A composition according to any one of the preceding 
claims wherein the Xenorhabdus species is Xenorhabdus 

25 nematophilus. 

6. A composition according to any one of claims 1 to 4 
wherein the Xenorhabdus species is ATCC 19061, NCI MB 
40886 or NCIMB 40887. 

30 

7. A composition as claimed in any one of the preceding 
claims which comprises a further pesticidal material not 
obtainable from Xenorhabdus. 

35 8. A composition according to claim 7 wherein the said 
further pesticidal material comprises a material 
obtainable from B. thuringiensis. 
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9. A composition according to claim 8 which further 
comprises cells of B. thuringiensis. 

10. A composition according to claim 8 wherein the 

5 pesticidal materials obtainable from B. thuringiensis 
comprises the delta endotoxin. 

11. A composition according to any one of the preceding 
claims which further comprises an agriculturally 

10 acceptable carrier. 

12. A composition according to claim 10 wherein the 
carrier comprises items of insect diet . 

15 13. A method for killing or controlling insect pests, 
which method comprises administering to a pest or the 
environment thereof a composition according to any one of 
the preceding claims. 

20 14. A method as claimed in claim 12 wherein the pests 
are insects from the order Lepidoptera or Diptera. 

15. A microorganism comprising Xenorhabdus strain NCIMB 
40B86. 

25 

16. A microorganism comprising Xenorhabdus strain NCIMB 
40887. 

17. A pesticidal agent which comprises a a toxin 
30 comprising a protein which is encoded by DNA which 

includes SEQ ID No. 1 or a variant or fragment thereof. 

18. An isolated pesticidal agent characterised in that 
it is obtainable from cultures of X. nematophilus or 

35 mutants thereof, has oral pesticidal activity against 
Pieris brassicae, Pier is rapae and Plutella xylostella, 
is substantially heat stable to 55 °C, is proteinaceous , 
acts synergistically with B. thuringiensis cells as an 
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oral pesticide, and is substantially resistant to 
proteolysis by trypsin and proteinase K. 

19. An isolated pesticidal agent as claimed in claim 18 

5 further characterised in that the pesticidal activity is 
substantially destroyed by treatment with sodium dodecyl 
sulphate or acetone or heating to 80°C. 

20. An isolated pesticidal agent as claimed in claim 18 

10 or claim 19 further characterised in that the agent is an 
extracellular protein. 

21. A recombinant DNA which encodes a pesticidal agent 
according to any one of claims 17 to 20. 

15 

22. A recombinant DNA of claim 21 which comprises the 
sequence of Figure 2 or a variant or fragment thereof. 

23. A recombinant DNA which comprises or hybridises 
20 under stringent conditions with all or part of the 

sequence of Figure 2, and which encodes a pesticidal 
material. 

24. An expression vector comprising a recombinant DNA 
25 according to any one of claims 21 to 23. 

25. A host organism which has been transformed with an 
expression vector according to claim 24. 

30 26. A host organism as claimed in claim 25 which has been 
engineered or selected such that it also expresses other 
pesticidal proteinaceous toxicity enhancing materials 

27. A host organism comprising a nucleotide sequence 
35 coding for a fusion protein comprising a pesticidally 
active portion of an agent as claimed in any one of 
claims 17 to 20 in combination with other pesticidal 
proteinaceous toxicity enhancing materials. 
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28. A host organism as claimed in claim 27 wherein the 
pesticidal toxicity enhancing materials comprise delta- 
endotoxin from B. thuringiensis . 

5 

29. A host organism as claimed in any one of claims 25 to 
289 wherein the host is a plant. 

30. A host organism as claimed in any one of claims 25 to 
10 28 wherein the host is a virus pathogenic to insects. 

31. A fusion protein as expressed by a host as claimed in 
claim 27. 

15 32. An pesticidal composition comprising one or more 
agents as claimed in any one of claims 17 to 20. 
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Fig.2. 

1 TCCACAATTG CCGGAGAAAA TCAG7CGGGA ACTGCCGGTG ATTATTCGTC ACTTATTAAA 
61 CGAATTTGCC GACCAGAATA AGGCTAAAAA ACTGCTACAG GCGCAACGCG ACTCGAACGA 
121 AGCGTTAACG GTAAAGAGTC ATTCGGATCC GCTGTATCGC TTTTGTGGTT ATCTGGTGTC 
181 TGTCAATGAT ATGACCGGAA TGAAGATGGG CAATAAAAAC ATTAGCCCAC GAGCACCGAG 
241 ATTGTACTTG TATCATGCCT ATCTCTCTTT TATGGAAGCG CACGGCTT7G AACGTCCGTT 
301 AACACTGACT AAGTTTGGTG AATCCATCCC CAAGATTATG CTGGAATACC GGAAGGAGTA 
361 TCGAAAAGTG CGAACCAAGA AAGGCTATTC CTATAACGTG GAATTATCGG AAGAGGCCGA 
421 AGAATGGCTA CCGTCAGTGC CTGAGTGTCG AGACTTTAAA TCACCTGTAT AAAACTTTGA 
4B1 GCTTTAAGTC TGCA CTCCAT ACACAACTTA AAATATCTAA TTGTATTTAA AAGAAAATAA 
541 TAGATGTATA GTTATTTTTT AACTATACAT AAGCTCTACA TGCTCTTCAT TCGTGTAAAA 
601 AATGGGTGAA CAGGTGATAC AGTCAGTGAA TATCATATTA ATTACCGTAA ACCCAGATGT 
661 AGCAAGGCTT TCAGGGAATT GTGCAGAGGG TGCATAACTG AGAGGGTGAA AAAGATTTTC 
721 AGGGGGGCTT ATGGCAGGTA AACAAAATCA GAAGCAAATA CCGTGCACAA TCTGG TTTTT 
781 ATTTTTTGGT ACTACCTCAA ATTAAAATGA TGTAATCATC TGATTTTATT TAAGAATAGA 

B41 AGTTAATCAC AATTTCATTG ATGGACTTTC ATTCACACTG GTATAGATAA ATAATTCTGT 

901 TATATCCTGT TTCATTACGC ATTCATCAGG AGTGCTGTTA CAGGAGACAA GAATGTCACA 
961 CATCATTTAC TTGTCGTTAA AGGGCAAGAA GCAGGGTTTA ATTTCAGCGG GTTGTTCAAC 

1021 GCCTGAATCA ATTGGAAATC GCTATCAAAA AGGACGTGAA GATCAAATAC AGGTATTGAG 

lOei CCTGAATCAT TCGATGAGCC GTGACCAGAA 7G77AA7CA7 CAACCCGTCA GTTTTG7GAA 

1141 ACCCATTGAT AAATCCTCTC CCCTGTTTGC TGGATGCCAG TTTTGTGCAT 7ACAGGACAA 

1201 GCCAGATGGG ACAACTGGAG TTCTTTTATG AAATCAAGCT GACCAGTGCC ACGATTGTGG 

1261 ATATTTCCTA TAATTATCCG GCATTCAATC AA7GA7AA7G GTGCGATACC CCATGAAGTG 

1321 GTGATGCTCG ATTATAAGTC CATTTCATGC AACCACATCG CCGCAGGACT TCGGGCTACA 

1381 GCATACGCAA TTAGCCGGAA GTGAAGAAGC AAGCCGCTTT TATCTGGGGT CTCGAATGTT 

1441 AAGCCACTTA AGAAGCCGCT GGTTGAAGAA ACCCCGGTAA AACCCGCTAA ACATCATGCC 

1501 CGTTATCGTT GTGTGGATGA TGACGGCAAT CTTTTAA CCG AACGCAAGTA TCGGGTTTGC 

1561 CTGCCGGATG GTCAGATAAA AGAAGGAAAG ACTGATAAAC AAGGTTACAC CCAATGGCAT 

1621 CTTACGGATG ACAAAAATAA ACTTGAATTT CATA7TTTAA AGGATTAATA CCATGCCAGC 

1681 CTATACCGTT CAGACAAAAA TAGAA7CCAA CG7AGC7GT7 GAAAACCTGC TTTACGACTT 

1741 AACCATTTAT CGTAAGGATG CAAAAGGAAA TTTCCATATG TTGCTTGATG TTTTTCAGGA 

1801 GAAACTACAG AGTAATTATG AAACACAACA GCA7A7CACG CAGGAAATAG ACGACGATCT 

1B61 TTCTGTGATT TATATTATGC AAATTATGCT TCACCGCAAA CATGGCTCAA ATATATTTCC 

1921 GGCAGTGCAA ACCCATTTTA AGAAAATGTA TACGCTCGGT GAATTAACTT CCGGTAAAGC 

1981 CTGTTCGGAG AAAAAACGGG AAAATGCCTG TTATTTTGAA AGTACAGTTG AAACAAAACC 

2041 TGTCAGCGAC GGGGATAATA CCGTTGACTT AAA7A7CACT ATTCCTGAAC GAC CTTTT AT 

2101 TGCCAAAGAA TATCCCATTG GTCACCCACA CGA7CCATTT GAAAAAAGTA AAATTGAATC 

2161 ATAAATACAG GACAGGTTAT CGAAAAGAAT 7TA7CCGGA7 CAAAATGGAG CAAGTTTATG 

2221 TCAGGGCGCG AGCACACTAT TTTAGCTGCG 77777AAGA7 GATTATCTCT TAATGTTCAG 

2281 TTTTAATAGT GTTTTTATCG AGTGAAATTT AATCGCACAG GCAATTCTTT AGA CTTTTA T 

2341 AGAAAACTAA AGAATTAAAG AACAAGATTG AC ATTTTAA G TTCAAATATT AATCAAAGTA 

2401 TGCTCGCGCC CTGAGTTTAT GTGGCCCTGC CGCTTTTTTT TATTGCCTGC CAATAGATAG 

2461 ACCAGATATT TATGAGCAAG CGGCACGAGA ATTATGGCAA TATGGCCGAA CTAAAATTGG 

2521 TCAACTGGAA ATTAAGCCGG GTGAGGGTTG CCGACA7CCT AAAGGTACT7 TTTATAATCA 

2581 ATATGGTGAA AGAATATCTG GGTTAGA77G GC7GACA77G GCAAGCCTAA GAGATTCAGA 

2641 AAATATGATG ATGAGGTTGA TGATGAAGTA GC7GG7A77A CAATGTGGGG AAAA77GACA 

2701 GAATGGTTTG AAAAATCAGG GTATGAAAAA G7A777AG7A ATGTCGGCT7 ATCCCAT7C7 

2761 AATA7AAATG ACATAGTAAC TCTTAGTGA7 TAC7A7AACA AAGGATATCA TGTTG7TAC7 

2821 7TGATT7CAG CAGGAATGTT A7CAGA777T GG7GACA7AG AAACA7CAGG AAAAAA7CAT 

2881 7GGA7AGTTT GGG AAGGAGT AGTAGAAAAC 7A7GAGAAAG AAAA7ATCAC AAA7AATTCA 

2941 GATC7GAATC AATATGTAAA 777AAA7CTG 7TT7CATGGG GTAAAGTGGA ACA7CAAATT 

3001 AAAAAAAACA AATCAC7AGA TTATGTACTC AACCATA7TT TTTGAGGGT7 GG TTT7T AAA 

3061 CCAA7GAAA7 AACATGAAAA AAA7ATTAA7 7A77TTTA77 TTTITACTTT ATGG77G7GG 

3121 TAATCCAACG CCAAAAG7T7 7ACCAAAATC AGAG777C77 CC7GA7GCAG TGA7AAA7GA 

3181 ACCA7A7CAG GCA7CAAT7A CCA7CACAGG AGG7GCA77G AATGAAAAAA GCG777GGG7 

3241 AAAAATTCAT CC7AC7GGC7 CAGGACTAAC A7GGAA7CCA AAAGATAGT7 CTT7CC7A7A 

3301 GGG7GGAAAA AAAGAAA7AA GAAAAGA77A TCATCA7ATA AA7A7AACAG GTACCCCAAA 

3361 GAAGA2AGAA 7TGATAAAAA TTGAAGTGG7 AGGA7TTACA 7TGGG7ACAA 7G7ACGCA CG 

3421 GAAAGAG7TC AC7ATA AAT7 A7AC7ATAAA AG7AAGGGAA 7AA7TGTCAC 7ATCAGAA7G 

3481 G7GA7TTAAT TCGCCAT777 7A7ACT777G TA7AC7C7CT CAACA7AATC AGGAT7C7TT 
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3541 CTTATTATTT TTCATGGTGC TAAAAACGTT TATTGCAAAA ATAA ATTA AG TTAATCAGAT 

3601 AAATTATCTG CATTACTGTT ATAATCGATA ACACGATAAC CTGACTTTCT GCCTGTTCTT 

3661 ATGAACTCGA AGATAATCCT TTCTGAGCCT GAACGAATCA CATTGCAACC ACTCGCTTTG 

3721 AATCACCCAC ACCGGGACAT TCGTACGCGA GGAACGGGTT TACTCATGCT TGCCAGAGGG 

3781 AGCAAGGCGT CCCAGATCAC CGCTGAAATC GGATGCAGTC TCCGGGTTAT CTGTAATTGG 

3841 GTTCACATGT GGCACAGATA GCGGGATTAT TCGGCGGTCA TGCCGGAGGC CGGTATCTCG 

3901- CCATGACGCC TGACATGATT GCCACTGCGC TCGAAGCCGC CAGCGCAGAG TCCCTGACGT 

3961 GCGTCGAAGC CAGGCAGGGT TTCCCTGCCT TGTACGCTTG AAACGCTGGC GAATACCCTC 

4021 AAAAAACAGG GGCTCCCCTA TAAACGCCCC CGCCTGTCGC TTAAAAAAAG CGCAATAAAA 

40 Bl CGGAGTTTGC TGAAAAATCC GCCTTGCTGA ATAAAATTAA GGCCGGAGCA CAGTCAGGAC 

4141 ATTACCGTCT GGTCTATTTT GAGTTCTGGG GGCGTTAAAT TACACGGATA ACACGCTGTT 

4201 TTACCAGACA ACGTCAGGCA GTATCACGCG AGATGACGTG ATTGATTTTT TAGAGCCGGT 

4261 GGCCAGACAA GGGACAACCG CCTGACATTT TTAGTGTTGG ATAATGCGCG TATCCATCAC 

4321 GGGATAGAGG AAAAAATCAG AAATGGCGGG TGACGAGAAC ACAACCTGTT TTTATTCTAT 

4381 CTTCCCGCTT ACAGCCCAGA GCTGTATCTG ATTGAAATCG TCTGGAAACA GGCCAAATAC 

4441 GACTGGCGAC GTTTTATCAC CTGGACTCAG GA TACAAT GG AATATGAGGT AAATACTTTA 

4501 TTGAAAGGTT ATGGCGACCA ATTTGCAATT AACTTTTCTT GAGTAC TTAG TAAGAATAGA 

4561 GTCAGTCGAG GTTTTTTCAT TTCGGGTCGT GGGGATGATA CTGAAAATTT GTTTGTAATC 

4621 TCTGAAAATT GCTGTTTCTG TGGCTACGTC TGTCTTTTGG GATATTGTTT CCATCAAGTC 

4681 TGTCAACATA CTGTTAAGTT AGATGTTGAT AAAAGAGACT GAATT ATAAT ACAAAACAAT 

4741 AAATCACTTG GACAATATTT TATTTCACAT GAGACATTAA GGTTGATTTT CCCAATCTGG 

4801 TCAGTTATAA CCGAATAAGG ATCTTGAAAA ATCATGGGAT CTTACTTTTA TCAAATGAAG 

4861 TTAACGTAAA AGTTGATAAA GAAAATTATT TAATTCTAAG TGCCGTTGGC ATAAATATTT 

4921 TGTGTTTTGT TAATGAATGA ATAACCAGGT AAGCTGGATT TTCATTTTTT AATTACTCGT 

4981 TACAATATGC TATTTATTTA TATAAAGAGT TTGTGCCCAT TTAACCAGTA AACAAATTTG 

5041 TTCAACCGTA ACTTAGCTTC ATCGACTTTT GGCCTCGCCT GGTCAGAATC TAGGGCCGTT 

5101 ATCCTATTTA TTTATGATAA ATAAAATTTA ATTATCTTTA ATAAGCTGAA TATGTGGATT 

5161 TGTGCTCAAT CTTGGATTCA AGTATGTATT CCTTTTGGTA CCCTGCTTTA TTTTAAGGCA 

5221 GATGAAGAGG ATGCCAACAT GACACAATAT CGATTACGAC TGTAACATTA AAGTCAGTTA 

5281 TAAATTTTAT GATTAAAATG AAATTTTAGT AGAAAATCGT ATTCTATTCC GCCATTTACA 

5341 ATAGCATCCT CTTTAATATC ATTAATCTCA GATAAAACAA ATAATTACAA TGTGAATAGA 

5401 ATAATGACTT ACAAAATAAG CACTAAATCT TCAGATGAAC TCTTAACTGA CAACACTATT 

5461 TTATAAAATA ATTGAGGTTA TTATGTATAG CACGGCTGTA TTACTCAATA AAATCAGTCC 

5521 CACTCGCGAC GGTCAGACGA TGACTCTTGC GGATCTGCAA TATTTATCCT TCAGTGAACT 

5581 GAGAAAAATC TTTGATGACC AGCTCAGTTG GGGAGAGGCT CGCCATCTCT ATCATGAAAC 

5641 TATAGAGCAG AAAAAAAATA ATCGCTTGCT GGAAGCGCGT ATTTTTACCC GTGCCAACCC 

5701 ACAATTATCC GGTGCTATCC GACTCGGTAT TGAACGAGAC AGCGTTTCAC GCAGTTATGA 

5761 TGAAATGTTT GGTGCCCGTT CTTCTTCCTT TGTGAAACCG GGTTCAGTGG CTTCCATGTT 

5821 TTCACCGGCT GGCTATCTCA CCGAATTGTA TCGTGAAGCG AAGGACTTAC ATTTTTCAAG 

5881 CTCTGCTTAT CATCTTGATA ATCGCCGTCC GGATCTGGCT GATCTGACTC TGAGCCAGAG 

5941 TAATATGGAT ACAGAAATTT CCACCCTGAC ACTGTCTAAC GAACTGTTGC TGGAGCTATT 

6001 ACCCGCAAGA CCGGAGGTGA TTCGGACGCA TTGATGGAGA GCCTGTCAAC TTACCGTCAG 

6061 GCCATTGATA CCCCTTACCA TCAGCCTTAC . GAGACTATCC GTCAGGTCAT TATGACCCAT 

6121 GACAGTACAC TGTCAGCGCT GTCCCGTAAT CCTGAGGTGA TGGGGCAGGC GGAAGGGGCT 

6181 TCATTACTGG CGATTCTGGC CAATATTTCT CCAGAACTGT ATAACATTTT GACCGAAGAG 

6241 ATTACGGAAA AGAACGCTGA TGCTTTATTT GCGCAAAACT TCAGTGAAAA TATCACGCCC 

6301 GAAAATTTCG CGTCACAATC ATGGATAGCC AAGTATTATG GTCTTGAACT TTCTGAGGTG 

6361 CAAAAATACC TCGGGATGTT GCAGAATGGC TATTCTGACA GCACCTCTGC TTATGTGGAT 

6421 AATATCTCAA CGGGTTTAGT GGTCAATAAT GAAAGTAAAC TCGAAGCTTA CAAAATAACA 

6481 CGTGTAAAAA CAGATGATTA TGATAAACAT GTAAATTACT TTGATCTGAT GTATGAAGGA 

6541 AATAATCAAT TCTTTATATG TGCTAATTTT AAGATATCGA GAGAATTTGG GGCGACTCTT 

6601 AGGAAAAACT CAGGGACAAG TGGCATTGTC GGCAGCCTTT CCGGTCCCCT GGTAGCCAAT 

6661 ACTAATTTCA AAAGCAATTA CTTAAGTAAC ATATCTGATA ATGAATACAG AAATGGCGTA 

6721 AAAATATATG CCTATCGCTA TACGTCTTCC ACCAGCGCCA CAAATCAGGG CGGCGGAATA 

6781 TTCACTTTTG AGTCTTATCC CCTGACTATA TTTGCGCTCA AACTGAATAA AGCCATTCGC 

6841 TTGTGCCTGA CTAGCGGGCT TTCACCGAAT GAACTGCAAA CTATCGTACG CAGTGACAAT 

6901 GCACAAGGCA TCATCAACGA CTCCGTTCTG ACCAAAGTTT TCTATACTCT GTTCTACAGT 

6961 CACCGTTATG CACTGAGCTT TGATGATGCA CAGGTACTGA ACGGATCGGT CATTAATCAA 

7021 TATGCCCGAC GATGACAGTG TCAGTCATTT TAACCGTCTC TTTAATACCC CGCCGCTGAA 

7081 AGGGAAAATC TTTGAAGCCG ACGGCAACAC GGTCAGCATT GATCCGGATG AAGAACAATC 

7141 TACCTTTGCC CGTTCAGCCC TGATGCGTGG TCTGGGGATC AACAGTGGTG AACTGTATCA 

7201 GTTAGGCAAA CTGGCGGGTG TATTGGACAC ACAAAATATC CTCACACTTT CTGTCCCTGT 

7261 TATATCTTCA CTGTATCGCC TCACGTTACT GGCCCGTGCC CATCAGCTGA CGGTTAATGA 

7321 ACTGTGTATG CTTTATGGTT TTTCGCCGTT CAATGGCAAA ACAACGGCTT CTTTGTCTTC 
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7381 CGGGGAGTTG TCACGGCTGG TTATCTGGTT GTATCAGGTG ACGCAGTGGC TGACTGAGGG 

7441 CGGAAATCAC CACTGAAGCG ATCTGGTTAT TATGTACGCC AGAGTTCAGC GGGAATATTT 

7501 CACCGGAAAT CAGTAATCTG CTTAATACTC TCCGACCCCG TATTAGTGAA GACATGGCAC 

7561 AAAGTAGTGA CCGGGAGCTT CAGGCTGAAA TTCTCGCGCC GTTTATTGCT GCAACGCTGC 

7621 ATCTGGCGTC ACCAGATATG GCGCGGTATA TCCTGTTGTG GACTGATAAC CTGCGGCCGG 

7681 GCGGCCTGAA TATCGCCGGA TTTATGATGC TGGTGCTGAA AGAGACGCTG AGTGATGAGG 

7741 AAACGACCCA ACTGGTTCAA TTCTGCCATG TAATGGCACA GTTATCGCTT TCCGTGCAGA 

7801 CACTGCGTCT CAGTGAAGCA GAGCTTTCTG TGCTGGTCAT TTCCGATTTT GTGGTACTGG 

7861 GTGCGAGAAG CCAACCGCCG GACAACACAA TATTGATACT CTGTTCTCAC TCTACCGATT 

7921 CCACCAGTGG ATTAATGGGC TGGGAAATCC CGGCTCTGAC ACGCTGGATA TGCTGCGCCA 

7981 AGCAGACACT CACGGGCGAC AGACTGGGCC TCCGTGATGG GGCTGGACAT CAGTATGGTA 

8041 ACGCAGGCCA TGGGTTCCCG CCGGCGTGAA CCAACTTCAG TGTTGGCAGG ATATCAACCC 

8101 CGTGTTGCAG TGGATACATG TGGCATCAGC ACTGCTCACT GATGCCGTCG GTTATCCGTA 

8161 CGCTGGTGAA TATCCGTTAC GTGACTGCAT TAAACAAAGC CGAGTCGAAT CTGCCTGCCT 

8221 GGGATAAGTG GCAGACGCTG GCAGAAAATA TGGCAGCCGG ACTGAGTACA CAACAGGCTC 

8281 AGACGCTGGC GGATTATACC GCAGAGCGCC TGAGTAACGT GTTGTGCAAT TGGTTTCTGG 

8341 CGAATATCCA GCCAGAAGGG GTGTCCCTGC ACAGCCGGGA TGACCTGTAC AGCTATTTCC 

8401 TGATTGATAA TCAGGTCTCT TCTGCCATAA AAACCACCCG ACTGGCAGAG GCCATTGCCG 

8461 GTATTCAGCT CTACATCAAC CGGGCGCTGA ACCGGATAGA GCCTAATGCC CGTGCCGATG 

8521 TGTCAACCCG CCAGTTTTTT ACCGACTGGA CGGTGAATAA CCGTTACAGC ACCTGGGGCG 

8581 GGGTGTCGCG GCTGGTTTAT TATCCGGAAA ATTACATTGA CCCGACCCAG CGTATCGGGC 

8641 AGACCCGGAT GATGGATGAA CTGCTGGAAG ATATCAGCCA GAGTCAGCTC AGCCGGGACA 

87 01 CGGTGGAAGA GGCCTTTAAA ACTTACCTGA CCGCTTTGAA ACCGTGGCAG ACCTGAAAGT 

8761 TGTCAGCGCT ATCACCGACA ACGTCAACAG CAACACCGGA CTGACCTGGT TTGTCGGCCA 

8821 AACGCGGGAG AACCTGCCGG AATATTACTG GCGTAACGTG CATATATCAC GGATGCAGGC 

8881 GGGTGAACTG GCCGCCGATG CCTGGAAAGA TTGGACGAAG ATTGATACAG CGGTCAACCC 

8941 ATACAAGGAT GCAATACGTC CGGTCATATT CAGGGAACGT TTGCACCTTA TCGTGGGTAG 

9001 AAAAAGAGGA AGTGGCGAAA AATGGTACTG ATCCGGTGGA AACCTATGAC CGTTTTACTC 

9061 TGAAACTGGC GTTTCTGCGT CATGATGGCA GTTGGAGTGC CCCCTGGTCT TACGATATCA 

9121 CAACGCAGGT GGAGGCGGTC ACTGACAAAA AACCTGACAC TGAACGGCTG GCGCTGGCCG 

9181 CATCAGGCTT TCAGGGCGAG GATACTCTGC TGGTGTTTGT GTACAAAACC GGGGTGAGTT 

9241 ACCCGGATTT TGGCGACAAC AATAAAAATG TGGCAGGCAT GACCATTTAC GGCGATGGCT 

9301 CCTTCAAAAA GATGGAGAAC ACAGCACTCA GCGTTACAGC CAACTGAAAA ATACCTTTGA 

9361 TATCATTCAT ACTCAAGGCA ACGACTTGGT AAGAAAGGCC AGCTATCGTT TCGCGCAGGA 

9421 TTTTGAAGTG CCTGCCTCGT TGAATATGGG TTCTGCCATC GGTGATGATA GTCTGACGGT 

9481 GATGGAAAAC GGGAATATTC CGCAGATAAC CAGTAAATAC TCCAGCGATA ACCTTGCTAT 

9541 TACGCTACAT AACGCCGCTT TCACTGTCAG ATATGATGGC AGTGGCAATG TCATCAGAAA 

9601 CAAACAAATC AGCGCCATGA AACTGACGGG GTTGGATGAA AGTCCCAGTA CGGCAATGCA 

9661 TTTATCATCG CAAATACCGT TAAACATTAT GGCGGTTACT CTGATCTGGG GGGCCCGATC 

9721 ACCGTTTTTA TTAAAACGGA AAAACTATAT TGCATCAGTT CAAGGCCACT TGATGAACGC 

97 81 AGATTACACT AGGCGTTTGA TTCTAACACC AGTTGAAAAT AATTATTATG CCAGATTGTT 

9841 CGAGTTTCCA TTTTCTCCAA ACACAATTTT AAACACCGTT TTCACGGTTG GTAGCAATAA 

9901 AACCAGTGAT TTTAAAAAGT GCAGTTATGC TGTTGATGGT AATAATTCTC AGGGCTTCCA 

9961 GATATTTAGT TCCTATCAAT CATCCGGCTG GCTGGATATT GACACAGGTA TTAACAATAC 

10021 TGATGTCAAA ATTACGGTGG TAGCTGGCAG TAAAACCCAC ACCTTTACGG CCAGTGACCA 

10081 TATTGCTTCC TTGCCGGCAA ACAGTTTTGA TGCTATGCCG TACACCTTTA AGCCACTGGA 

10141 AATCGATGCT TCATCGTTGG CCTTTACCAA TAATATTGCT CCTCTGGATA TCGTTTTTGA 

10201 GACCAAAGCC AAAGACGGGC GAGTGCTGGG TAAGATCAAG CAAACATTAT CGGTGAAACG 

10261 GGTAAATTAT AATCCGGAAG ATATTCTGTT TCTGCGTGAA ACTCATTCGG GTGCCCAATA 

10321 TATGCAGCTC GGGGTGTATC GTATTCGTCT TAATACCCTG CTGGCTTCTC AACTGGTATC 

10381 CAGAGCAAAC ACGGGCATTG ATACTATCCT GACAATGGAA ACCCAGCGGT TACCGGAACC 

10441 TCCGTTGGGA GAAGGCTTCT TTGCCAACTT TGTTCTGCCT AAATATGACC CTGCTGAACA 

10501 TGGCGATGAG CGGTGGTTTA AAATCCATAT CGGGAATGTT GGCGGTAACA CGGGAAGGCA 

10561 GCCTTATTAC AGCGGAATGT TATCCGATAC GTCGGAAACC AGTATGACAC TGTTTGTCCC 

10621 TTATGCCGAA GGGTATTACA TGCATGAAGG TGTCAGATTG GGGGTTGGAT ACCAGAAAAT 

10681 TACCTATGAC AACACTTGGG AATCTGCTTT CTTTTATTTT GATGAGACAA AACAGCAATT 

10741 TG7ATTAATT AACGATGCTG ATCATGATTC AGGAATGACG CAACAGGGGA TCGTGAAAAA 

10801 TA7CAAGAAA TACAAAGGAT TTTTGAATGT TTCTATCGCA ACGGGCTATT CCGCCCCGAT 

10861 GGATTTCAAT AGTGCCAGCG CCCTCTATTA CTGGGAATGT TCTATTACAC CCCGATGATG 

10921 TGCTTCCAGC GTTTGCTACA GGAAAAACAA TTCGACGAAG CCACACAATG GATAAACTAC 

10981 GTCTATAATC CCGCCGGCTA TATCGTTAAC GGAGAAATCG CCCCCTGGAT CTGGAACTGC 

11041 CGGCCGCTGG AAGAGACACT CCTGGAATGC CAATCCGTTG GATGCCATTG ATCCGGATGC 

11101 CGTCGCACAA TATGACCCGA CACACTATAA AGTTGCCACC TTTATGCGCC TGTTGGATCA 

11161 ACTTATTCTG CGCGGCGATA TGGCCTATCG CGAACTGACC CGCGATGCGT TGAATGAAGC 
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CAAGATGTGG 
CCAACAGTGG 
TCAACAAGAC 
CTCGTTGGTG 
TGCGTTTGCG 
GGCGAATTAC 
AGGGCGGTAG 
AGCGGGCCCG 
CAGAGCATGA 
CGACACAGAG 
TATTGGCAGA 
ACGAGGATAT 
AGTCTCTGGC 
TGTTCGGTTT 
TGATGTCGCT 
CCTACCGCCG 
AACAAATGGA 
TGGAATATCA 
AATTCACAAA 
AGTTCTTTGA 
TGACCGACAA 
TGATGGCGGG 
GTGATGAGCG 
TATCATCAGA 
GCAACGTAGG 
TGCGATTGTC 
AG7TGAAACA 
CGGTGCTGAA 
CCCACGGCGT 
CGTTTGAAGG 
CTGATCGACA 
CCATTCGTTC 
GTTTTTATGC 
GGATCACTAA 
TTTCACTGCC 
ACAGCAGTAC 
TCAGCCTGCG 
CGGATGGGGA 
CAACCTCACT 
GCGTGG CAGA 
AGACGTCTTT 
ATCATGCACG 
AAACCGTCAC 
ACTGTGATGA 
AGTCCACTAT 
TGTTGATAAT 
GAACTCCGTA 
AAAATGGCGT 
CCGTCGCTTG 
GGTTGCAGAA 
CAAGGTTTCC 
GATGATGTCC 
GCAGTCCATG 
ATATGGAGAA 
GTGCTCCGGT 
AACCACTGCC 
ACGGGCGTCT 
CGGAAGGTGA 
CGCAGGCAAA 
CAAATAGTGT 
TTCATTTGTC 
TCAGTGATAT 
GCTACTGGCC 
AAATTACGGG 



TATGTGCGTG 
GCCGCACCGT 
CTTACGGCGC 
GTTTGGTCCT 
CCTGGTTAAC 
GCGAGCCTAC 
TGCAGTGCTG 
CAATCTGGTA 
TGATGCCGAT 
CATCCGTATT 
GAGCCGCCGC 
CAACCACGGA 
CGGGCAGGCG 
CGCTTGTGGC 
TTCTGCCACA 
CCGCCGTCAG 
TGCCCAGCTG 
GGAGACCCAG 
CAAAGCGCTT 
CCTGACCCAG 
CGGTGTTACC 
TGAAACGTTG 
GGCACTGGAA 
CAACTTTAAT 
AGCTTCCGGC 
TGATTTGAAA 
AGTGAGTGTC 
TTACGGCGGC 
GAATGACAGT 
TATTTCCGTG 
GAAAGCGCTG 
TTAATTAAAA 
AGGGTTCAAC 
AAGGAATGGG 
CTTGCCGATC 
TGCTGGCAAT 
TACCGCCAAG 
AGTGTTGAGT 
GTTGGGGACG 
AAAAATCGTT 
TTGGGTACTT 
TATTGCTGAC 
GCATACCGGG 
GCATGAACTT 
GGCAATACTC 
GACTGGTTGT 
CCCGAATTCA 
TGTCGTCCGG 
TGTCGCCAAG 
GAAACACCGG 
TTGCTGCAAA 
CCGCTGGAAA 
CCGCAGTTAG 
GGAATTTCCG 
ACGGGATATC 
ACATATTCCG 
GGATTGGGTG 
ATGGACACCC 
ACTGGCTGAT 
ACGTGTCTGG 
AAATAAGCCA 
GACAGGCTCC 
GAACCTGGGG 
GAAACGTTTA 
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CTTTGGAATT 
CTCTTTCCGT 
TAGACAACGG 
GCCGGAATAT 
CTGCGCCATA 
GATCCGAAAG 
CCCGGCACAT 
GCGCAATTAA 
GAACTCACCA 
CAGCAACGAA 
AGTGCACAAA 
GAACAGCGTG 
CTCTCAGTAG 
GGCAGTCGTT 
GCTTCCCAAT 
GAGTGGGAAA 
GAAAGCCTGA 
CAGGCCCATA 
TACAGTTGGA 
TCCTTCTGCC 
TTTATCCGGG 
CTGCTGAATC 
GTGACCCGTA 
CTGACCGAAA 
AATGAATTAA 
ATTTTCAGCG 
ACCTTGCCGG 
AGCATCGTCA 
GGTCAATTTA 
AATGACAGCG 
CTGGAGAGCC 
CATTGTGATA 
ACCTTTGAAA 
AGAAGCACTC 
TCTGTCCGGC 
GGGTCATTCG 
GGCGTTCCGC 
ATTGTGCCGG 
GTTCTGACAC 
CGTTTAGAAC 
TTTACTGCGG 
CCGCAGGATG 
GAACATATTT 
GCTCAGCATT 
AGCCGGAAAC 
TTCATCTGGT 
ATGTGTCAGA 
ACAGTTTCTC 
TTCTGATGTT 
CGCTGGTTTC 
CGGCCCGCAG 
TGGATTATCA 
AAAAAATGAA 
GCGTTACTTT 
ACTGCCGAAG 
GCACAACAGG 
ATTACGGCAT 
TTTATTCCAT 
ATTGATGGGG 
TCAAATAATC 
CTGCCGGTTC 
GGGCAATCAC 
CATGGAAAAT 
ACCCCCACAG 



GCTGGGTGAT 
GGCGGGCAAC 
AGAAGGTTGC 
AACCCGGAAT 
ATCCTTCCAT 
CGCTGCTCAC 
TGTCGTTATA 
CCCAGTTCGG 
CGTTGCTACT 
CTGTCGATGA 
ATCGTCTGGA 
CGATGTCACT 
CAGAAGGGGT 
GGGGGGCAGC 
ATTCCGCAGA 
TTCAGCGTGA 
AAATACGCGG 
CTCAGGCTCA 
TGCGCGGCAA 
TGATGGCACA 
GTGGGGCCTG 
TGGCAGAAAT 
CCGTCTCGTT 
AACTCACGCA 
AACTCAGTAA 
ATACCCCGGA 
CGCTGGTTGG 
TGCCACGCGG 
TGCTGGATTT 
GTAGCCTGAC 
TGAGCGATAT 
GGCAGGCTCC 
CTTGAAATAC 
AATGCCGTCG 
GTGGTCTGGT 
GGATGGGGTG 
ACTATACGGG 
ACAGCCAAGG 
AGCCGCCTAC 
ACTGGCAGCC 
ATGGTTTAGT 
AAACCAGAAT 
ACTATCACTA 
CAGGTGTTAC 
CGCTTTTTTC 
ATTTGATTAC 
AAACAATGTG 
CCGCTATGAA 
TCATCAGCTG 
CCGTCTTATT 
ACTGGCCCAT 
ACGTGTTAAT 
CACGTTGCAG 
ATCAGGATAC 
GAACGAATGC 
AAAGCGCGAT 
CAGGGTTACG 
TATCCGCTGT 
CTGGGCTGCC 
CGGCAGGATG 
CCGGCAAAAA 
ATCTGGTGGA 
TTGGTGAGCC 
ACTGTATATG 



GAGCCGGAGG 
CACACTGTGC 
ACTCAACCCC 
CAACCGATTA 
GACGGGCAAC 
CAGTATGGTA 
CCGCTTCCCG 
CACCTCTCTG 
ACAGCAGGGT 
AGTGGATGCT 
AAAATACCAG 
GTTTGATGCG 
GGCTGACTTA 
ACTGCGTGCT 
CAAAATCAGC 
TAATGCTGAC 
CGAAGCAGCA 
GTTAGAGCTG 
GCTGAGTGCT 
GGAAGCGCTG 
GAACGGTACG 
GGAAAAAGTC 
GGCACAGTTC 
ATTCCTGCGT 
CCG CCAGA TA 
AAGCTTTGGC 
TCCGTATGAA 
TTGCAGTGCT 
CAACG ATTC C 
GTTGAGTTTC 
CATTCTGCAT 
TGAGGGAGCC 
CGTCATTGCC 
GAGCGGAAGG 
GCCGGTGCTA 
GCAATGTGGG 
ACAAGATGAG 
GCAACCAGAG 
TGTTACCCGC 
ACAGCAGAGA 
GCACCTATTC 
TGCCCGCTGG 
TCGGGCAGAA 
GGCCCACCGT 
GCGGTAAAAT 
GGTGAGCGCT 
TCTGAAAACA 
TATGGGTTTG 
AAAGCGCTGG 
CTGGATTATG 
GAAACGGACG 
CATGGCGTGA 
CCATACCAAT 
TCAGAAAGCC 
GGTTACCTAT 
GTTGTTGGAC 
GGGCTACCAC 
GCCAATGGAA 
TGACTTAGCG 
GGATCGCGCT 
TAAGCGTCAT 
AGTTACGGCA 
TCTGATGATA 
GTAGACCTAA 



ATTACGGCAG 
AAGCGGGCTA 
GCAACGCTAA 
CTGG CAAACC 
CGTTATCGCT 
CAGCCTTCTC 
GTGATGCTGG 
CTCAGTATGG 
ATGGAACTGG 
GATATTGCTG 
CAGCTGTATG 
GCGGCAGGTC 
GTTCCAAACG 
TCCGCCTCCG 
CGTTCGGAAG 
GGTGAAGTCA 
CAGATGCAGG 
TTACAGCGTA 
ATCTATTACC 
CGCCGCGAGC 
ACTGCGGGTT 
TGGCTGGAGC 
TATCAGGCCT 
GAAGGGAAAG 
GAAGCCTCAG 
AATACCCGTC 
GATATCCGGG 
ATTGCTCTCT 
CGTTATCTGC 
CCGGATGCGA 
ATCCGCTATA 
TGTTTAAGGA 
CTCTGGGGGC 
GGAGCGTCAT 
TCACTGAATT 
GTTGGTTTTA 
TATCTCGGGC 
CAACGCACCG 
TATCAGTCCC 
CGTGAGGAAG 
GGTAAGCATC 
CTGATGGAGG 
GACGATCTTG 
TATCCTGGCA 
CAGGTATCCC 
TATCTTCGCT 
ATGTGTCTGA 
AAATTCGAAC 
CAGGGGAAAA 
ACCTGAACAA 
GTACGCCAGT 
ATCTGAACTG 
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15061 CACCACCCGA TTTTATTTAT GCCCGCAATA CTTACCTTGA ACTCTATGCC AATGAAAGCG 

15121 GCAATCATTC TGCTGAACCT CAGCGTATTG ATCTGCCGGA TGGGGTACGT TTTGATGATA 

15181 CTTGTCGGTT ACAAATAGCG GATACACAAG GATTAGGGAC TGCCAGCATT ATTTTGACGA 

15241 TCCCCCATAT GAAGGTGCAG CACTGGCGAT TGGATATGAC CATATTCAAG CCTTGGCTGC 

15301 TGAATGCCGT CAATAACAAT ATGGGAACAG AAACCACGCT GTATTATCGC AGCTCTGCCC 

153 61 AGTTCTGGCT GGATGAGAAA TTACAGGCTT CTGAATCCGG GATGACGGTG GTCAGCTACT 

15421 TACCGTTCCC GGTGCATGTG TTGTGGCGCA CGGAAGTGCT GGATGAAATT TCCGGTAACC 

15481 GATTGACCAG CCATTATCAT TACTCACATG GTGCCTGGGA TGGTCTGGAA CGGGAGTTTC 

15541 GTGGTTTTGG GCGGGTGACG CAAACTGATA TTGATTCACG GGCGAGTGCG ACACAGGGGA 

15601 CACATGCTGA ACCACCGGCA CCTTCGCGCA CGGTTAATTG GTACGGCACT GGCGTACGGG 

15661 AAGTCGATAT TCTTCTGCCC ACGGAATATT GGCAGGGGGA TCAACAGGCA TTTCCCCATT 

15721 TTACCCCACG CTTTACCCGT TATGACGAAA AATCCGGTGG TGATATGACG GTCACGCCGA 

157 Bl GCGAACAGGA AGAATACTGG TTACATCGAG CCTTAAAAGG ACAACGTTTA CGCAGTGAGC 

15841 TGTATGGGGA TGATGATTCT ATACTGGCCG GTACGCCTTA TTCAGTGGAT GAATCCCGCA 

15901 CCCAAGTACG TTTGTTACCG GTGATGGTAT CGGACGTGCC TGCGGTACTG GTTTCGGTGG 

159 61 CCGAATCCCG CCAATACCGA TATGAAGGGG TTGTTACCGA TTCCACAGTG CAGCCAAAAG 

16021 ATTGTCCTTA AATATGATGC GTTAGGATTT CCGCAGGACA ATCTTGAGAT TGCCTATTCG 

16081 AGACGTCCAC AGCCTGAGTT CTCGCCTTAT CCGGATACCC TGCCCGAAAC A CTTTT CACC 

16141 AGCAGTTTCG ACGAACAGCA GATGTTCCTT CGTCTGACAC GCCAGCGTTT TTCTTATCAC 

16201 CATCTGAATC ATGATGATAA TACGTGGATC ACAGGGCTTA TGGATACCTC ACGCAGTGAC 

162 61 GCACGTATTT ATCAAGCCGA TAAAGTGCCG GACGGTGGAT TTTCCCTTGA ATGGTTTTCT 

16321 GCCACAGGTG CAGGAGCATT GTTGTTGCCT GATGCCGCAG CCGATTATCT GGGACATCAG 

16381 CGTGTAGCAT ATACCGGTCC AGAAGAGCAA CCCGCTATTC CTCCGCTGGT GGCATACATT 

16441 GAAACCGCAG AGTTTGATGA ACGATCGTTG GCGG CTTTTG AGGAGGTGAT GGATGAGCAG 

16501 GAGCTGACAA AACAGCTGAA TGATGCGGGC TGGAATACGG CAAAAGTGCC GTTCAGTGAA 

16561 AAGACAGATT TCCATGTCTG GGTGGGACAA AAGGAATTTA CAGAATATGC CGGTGCAGAC 

16621 GGATTCTATC GGCCATTGGT GCAACGGGAA ACCAAGCTTA CAGGTCAAAC GACAGTGACG 

166 81 TGGGATAGCC ATTACTGTGT TATCACCGCA ACAGAGGATG CGGCTGGCCT GCGTATGCAA 

16741 GCGCATTACG ATTATCGATT TATGGTTGCG GATAACACCA CAGATATCAA TGATAACTAT 

16801 CACACCGTGA CGTTTGATGC ACTGGGGACG GTAACCAGCT TCCGTTTCTG GGGGACTGAA 

16861 AACGGTGAAA AACAAGGATA TACCCCTGCG GAAAATGAAA CTGTCCCCTT TATTGTCCCC 

16921 ACAACGGTGG ATGATGCTCT GGCATTGAAA CCCGGZATAC CTGTTGCAGG GCTGATGGTT 

16981 7ATGCCCCTC TGAGCTGGAT GGTTCAGGCC AGr7777C7A ATGATGGGGA GCTTTATGGA 

17041 GAGCTGAAAC CGGCTGGGAT CATCACTGAA GA7GG77ATC TCCTGTCGCT 7GC7T7TCGC 

17101 CGCTGGCATC AAAATAACCC TGCCGCTGCC A7GCCAAAGC AAG7CAATTC ACAGAACCCA 

17161 CCCCATGTAC TGAGTG7GAT CACCGACCGC 7A7GA7GCCG ATCCGGAACA ACAATTACGT 

17221 CAAACGTT7A CGT7TAGTGA TGGTTTTGGG CGAAACCT7A CAAACAGCCG TACGCCATGA 

17281 AAGTGGTGAA GCCTGGGTAC CTGATGAGTA 7GGAGCCAA7 GTGGCTGAAA ATCAAGGCGC 

17 341 CCCTGAAACG GGCGATTACA AATTTCCCGT TGGGCAA7TT CCCGGACGTA CAGAATATTA 

17401 ACGGGAAAAG GCAAAGCCCC 7GCGT7ACG7 77CAAACCGT ATTCC7GAAA TAATTTGGGC 

17461 AACTATGTCA AGTTGACCAA AAAATGCCCG GCAGG A7A7G TATGCCGATA CCCATTACTA 

17521 TGATCCG7TG GGGCGTGAAT A7CAGGTTAT CACGCCAAAG GCGGGTTGCG TCGATCCTTA 

17581 TTCACTCCC7 GGT7TGTGGT GAATGAAGTT GAAAA7GACA CTCCCGGTGA ATGACAGCAT 

17641 AAAGCTCAG7 GATGCCTGT7 CACTGAACAG ACA7CACTCC ATTTAGGAA7 GAA7CATGAA 

17701 GAATT7CGTT CACAGCAATA CGCCATCCGT CACCGTACTG GACAACCG7G G7CAGACAGT 

17761 ACGCGAAA7A GCC7GGTATC GGCACCCCGA TACACC7CAG G7AACCGA7G AACGCATCAC 

17821 CGGTTATCAA TATGATGCTC AAGGATC7CT GAC7CAGAG7 ATTGATCCGC GATTTTATGA 

178 81 ACGCCAGCAG ACAGCGAGTG ACAAGAACGC CA7TACACCC AATCTTATTC TCTTGTCA7C 

17941 ACTCAGTAAG AAGGCATTGC GTACGCAAAG TG7GGATGCC GGAACCCGTG TCGCCCTGCA 

18001 TGATGTTGCC GGGCGTCCCG TTTTAGCTGT CAGCGCCAA7 GGCGTTAGCC GAACGTTTCA 

18061 GTATGAAAG7 GATAACCTTC CGGGACGA7T GC7AACGATT ACCGAGCAGG TAAAAGGAGA 

18121 GAACGCCTGT ATCACGGAGC GATTGAT77G G7CAGGAAA7 ACGCCGGCAG AAAAAGGCAA 

18181 7AA77TGGCC GGCCAGTGCG 7GGTCCAT7A TGA7CCCACC GGAATGAATC AAACCAACAG 

18241 CATATTGTTA ACCAGCATAC CCTTGTCCAT CACACAGCAA TTAGTGAAAG ATGACAGCGA 

18301 AGCCGATTGG CACGGTATGG ATGAA7TTGG CTGGAAAAAC GCGCTGGCGC CGGAAAGCTT 

18361 CAC7TCTGTC AGCACAACGG ATGCTACCGG CACGG7ATTA ACGAGTACAG ATGCTGCCGG 

18421 AAACAAGCAA CGTATCGCCT ATGATG7GGC CGG7C7GC7T CAAGGCAGTT GGTTGGCGCT 

18481 GAAGGGGAAA CAAGAACAAG 7TATCGTGAA A7CCC7GACC 7ATTCGGCTG CCAGCCAGAA 

18541 GCTACGGGAG GAACATGGTA ACGGGATAGT GAC7ACATAT ACCTATGAAC CCGAGACGCA 

18601 ACGAGTTATT GG CAT AAAAA CAGAACG TCC TTCCGGTCAT GCCGCTGGGG AGAAAATTTT 

18661 ACAAAACCTG CG TTATG AAT ATGATCCTGT CGGAAATGTG CTGAAA7CAA C7AATGATGC 

18721 TGAAATTACC CGCTTTTGGC GCAACCAGAA AATTG7ACCG GAAAATACTT ACACCTATGA 

18781 CAGCCTG7AC CAGCTGGTTT CCGTCACTGG GCGTGAAATG GCGAATATTG GCCGACAAAA 

18841 AAACCAGTTA CCCATCCCCG CTCTGATTGA 7AACAATAC7 TATACGAATT ACTCTGGCAC 
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18901 TTACGACTAT GATCGTGGGG GAATCTGACC AGAATCGCAT AATTCACGAT CACCGGTAAT 

18961 AACTATACAA CGAACATGAC CGTTTCAGAT CACAGCAACC GGGCTGTACT GGAAGAGCTG 

19021 GCGCAAGATC CCACTCAGGT GGATATGTTG TTCACCCCCG GCGGGCATCA GACCCGGCTT 

19081 GTTCCCGGTC AGGATCTTTT CTGGACACCC CGTGACGAAT TGCAACAAGT GATATTGGTC 

19141 AATAGGGAAA ATACGACGCC TGATCAGGAA TTCTACCGTT ATGATGCAGA CAGTCAGCGT 

19201 GTCATTAAGA CTCATATTCA GAAGACAGGT AACAGTGAGC AAATACAGCG AACATTATAT 

19261 TTGCCAGAGC TGGAATGGCG CACGACATAT AGCGGCAATA CATTAAAAGA GTTTTTGCAG 

19321 GTCATCACTG TCGGTGAAGC GGGTCAGGCA CAAGTGCGGG TGCTGCATTG GGAAACAGGC 

193 81 AAACCGGCGG ATATCAGCAA TGATCAGCTG CGCTACAGTT ATGGCAACCT GATTGGCAGT 

19441 AGCGGGCTGG AATTGGGACA GTGACGGGCA GATCATTAGT CAGGAAGAAT ATTACCCCTA 

19501 TGGGGGAACC GCCGTGTGGG CACCCGAAAT CAGTCAGAAG CTGATTACAC AAGCCGGCGT 

19561 TATTCTGGCA AAGAGCGGGA TGCAACAGGG TTGTATTACT ACGGCTATCG TTATTATCAA 

19621 TCGTGGACAG GGCGATGGTT GAGTGTAGAT CCTG CCGGTG AGGCCGATGG TCTCAATTTG 

19681 TTCCGAATGT GCAGGAATAA CCCCATCGTT TTTTCTGATT CTGATGGTCG TTTCCCCGGT 

19741 CAGGGTGTCC TTGCCTGGAT AGGGAAAAAA GCGTATCGAA AGGCAGTCAA CATCACGACA 

19801 GAACACCTGC TTGAACAAGG CGCTTCCTTT GATACGTTCT TGAAATTAAA CCGAGGATTG 

19861 CGAACGTTTG TTTTGGGTGT GGGGGTACAA GTCTGGGGGT GAAGCGGCCA CGATTGCAGG 

19921 AGCGTCGCCT TGGGGGATCG TCGGGGCTGC CATTGGTGGT TTTGTCTCCG GGGCGGTGAT 

19981 GGGG TTTTT C GCGAACAACA TCTCAGAAAA AATTGGGGAA GTTTTAAGTT ATCTGACGCG 

20041 TAAACGTTCT GCTCCTGTTC AGGTAGGCGC TTTTGTTGTC ACATCGCTTG TGACGTCTGC 

2 01 CI AC7ATTTAAC AGCTCTTCGA CAGGTACCGC CATTTCCGCA GCAACAGCGG TCACCGTTGG 

20161 AGGATTAATG GCTTTAGCCG GAGAACATAA CACGGGCATG GCTATCAGTA TTGCCACACC 

2022 1 CGCCGGACAA AGTACGCTGG ATACGCTCAG GCCCGGTAAT GTCAGCGCGC CAGAGCGGTT 

202 B 1 AGGGCACTAT CAGGCGCAAT TATTGGCGGC ATATTACTTG GCCGCCATCA GGGAAGTTCT 

2034-» GAGCTGGGTG AACGGGCAGC GATTGGTGCT ATGTATGGTG CTCGATGGGG AAGGATCATT 

2040^ GGTAATCTAT GGGATGGCCC TTATCGGTTT ATCGGCAGGT TACTGCTCAG AAGAGGCATT 

20461 AGCTCTGCCA TTTCCCACGC TGTCAGTTCC AGGAGCTGGT TTGGCCGAAT GATAGGAGAA 

20521 AGTGTCGGGA GAAATATTTC TGAAGTATTA 7TACCTTATA GCCGTACACC CGGTGAATGG 

20581 GTTGGTGCAG CCATTGGCGG GACAGCCGCG GCCGCTCATC ATG CCGTT GG AGGGGAAGTT 

20641 GCCAATGCCG CTAGCCGGGT TACCTGGAGC GGCTTTAAGC GGGCTTTTAA TAACTTCTTC 

20701 T77AACGCCT CTGCACGTCA TAATGAATCC GAAGCA7AAC AATCATGTTC ATTCCCACTT 

20761 TG7CATGGAT GACAAGGTGG GTTTTTCGGA TGTGTGGACA GAGACCCGTA CAGGGTCTCT 

208^1 GTCC^GTTAA TTTTTGGATC AAGAACGAA7 GGTGTAACGG ATATGCAAAA TGATATCGCT 

9066^ CAGGCTGAGC AATAAGCTTT TCTGTTTACC AC7GA7ACCG GGAAAACTGA GGGTTAATGT 

2 0941 GCCTGTATCG GCCACAGGAA GCCCTTCAAA 7GGCAGGTAC TTAGCATCAT TGAAATCCAT 

21001 CTGGAATTGA CCACTGTCAT TCATGCCA7G 7GAGATCACA ATCGCTTTGC AGCCACG7GG 

21061 CATCATTGTA CTGCCGCCAT AACTCAG7A7 7GCCCGGACA TCCTGATAAG GCCC7AAAAG 

21121 GGCAGGTAAC GTCACACTGA T7TG7TTGA7 ACGGCG7G7A TTACCTAAAC CGTCAGGA7A 

21181 A7CGGTAGCA ATATTCAGAT CCGATAATT7 GAGGCTGGCT TGCAGTTGTG TCCCTTCGAC 

21241 G77CAAACCG 7TAAGCGTTG TGCCTGCAC7 GCCTTCACC7 GCA7TGACTA ACTCAG7CAC 

213 01 TTTATCTTTT AAAATGAAAC TATTITCTGT CAGACCAGCA TAC ACTTCAG CCAGAGAAAC 
2136* GG7TCTGGTG ACCTCCAGTG CCCGTTCA7C 7TT77CCAAA TAGCTTTTTT CCATCTGTGC 
21421 TAAATTCAGC ATCAGGGTTT CACCCGCTAA 7AAACCCGCA TAAGTCCCAT GCCAAGCACC 

214 81 TGGTTTAATA AAGTGTGCTG CCGCATTAT7 CAA77CA7AC TGATAAGTTT GCTCTGCCA7 
21541 TAAACAGAGT GAGACCGCCA AATCATAAAA CTGATAA7AA ATAGCGGACA ACGTTCCACG 
21601 GAGCCAGTTG TATAGCGCTG CATTACTGAA TTTAC7TTGC AGAAAGGCTA ACTGCGCCTG 
21661 AGTTTGTGCC TGCTGAGTTT CCAGATAG77 7T7T7GTAA7 ACTGCCGCTT CACGACGTAC 
21721 AGCCAGCGTC GCTAATTGAG CATCAATTTG 7T77A7C7CA GCTTCCGCA7 TATTGCGCTG 
21781 AATTTCCCAC TCTTGCCGAC GGCGACGGTA TA7TTCTGAT TGGCTGATTT TGTCTGCGGC 
21841 AATACGTGTT GCTGACGCAG AAATTTCGA7 ACCAATCGCA CTGGCATTGA AAAGCGCCCC 
21901 AAAACGGGAA CCTCCCACAG CAAAACCGTA AATA7TGGGG ACGAGATCTG CCGCGGCGGC 
21961 GGCCATATGC AGGGCTGTGC CGCTGGTGCT CAAGACCGA7 GAAGAGAGGT AAAGATCCA7 
22021 CGCTTGTTTT TCACCAGCGT TAACATCTTC GTCG7ACAGC G7ATTGAAAC TGTCAAAACG 
22081 AGACTGTGCA CCATGACGGC TTTCTTGAAG CGCCAATTTA TCAGCATCAA TTTCAGCCA7 
22141 GACCTTATCC TGCATTTTAA TACTTTGCAG GGCTAACTCA CTGCCTTGAG TTTGCAGTA7 
22201 TTCAGCCAAG GCTTCTGCAT CCTGCCGTTC AGTAATGCTG AGCAGGGTAT TGCCAAATTG 
22261 TATCAACTGG CTTACCCCCC ACTTGGCAT7 7TCCAGAATC ACCGGAAAAC GGTACATCGG 
223 21 CATCACTGCA TGAGGTAAA7 CGCCGCCGCC TTGTGAAGCA G7GATGGCAG CACTGAGTAA 
22381 CA7GGACGGA TCTGCGGGCG 7GGCATAGAG AGATAATGAC AGTGGCTGAC CGTCGATTG7 
22441 CAGGTTATGG CGTAAGTTAT AGAGGCGTTG CGTCAATG7C TGCCAGTAAC CTTGCAGTTT 
22501 T77ATTAATT TGAGGGAGGA ACAATGCGG7 TAACGAAA7T TGCCGTACGT TTCGTGGGTA 
22561 ATGCAGCGCG CTGACGCAGT TGCAGCAT7T TA7G77GA7A ATGATGCCGC AT7GTTTGGC 
22621 7GGCAGCTTC TTCCAGCCGT GGCTC7GACC AATCGTTA7C CAATGAAAAA TAAGGCTCA7 
226 SI CA^CCAATAA AGTGAGCGCC 7G7ACA7ACC ACA77T7AGC 7TCGTTTAAG GTA7CACG77 



SUBSTITUTE SHEET (RULE 26) 



WO 98/08388 



PCT/GB97/02284 



Fig.2. 



8/12 



22741 CAAGCTGGCG ATAGGCGCTA TCTCCGCGGG TAATCAACAA ATCCAGCATT TTCATAAAGG 
22801 TAGCCACTTT ATAGTGCATC GGATCATGCT GGGCAACGGC GTCCGGATCG ACCGAATCCA 
22861 GCGGATTGGC ATTCCAGGAC GTATCTTCCT CCAATGGGCG GACGTTCCAG TAATAATCCT 
22921 GCATTTCACC CTGAACCGAA TATCCGGTCG GGTTCAGATA TAGCGCAGCC AGCGTGTCGA 
22981 TCCGGTAAAA TCTGCTCTTG CAATAAGCGC TGGAATACCA TCATGGGCGT TCTAATAGAA 
23041 CAATCCCAAG AAATAGATTG CATTGGCGCC GTTTGAAATC CATGGGTTCA GTGTTATTTT 
23101 TCATGACACG ACTTGAATAC CCCTTTTATA TTTTTTGATA TTTTTTACTA TCCCCTGTTG 
23161 TGTCATTCCC GAATCATGAT CGGCATCATT AGTGAATATA AATTGATTTT TCGTCTCATC 
23221 AAAATAAAAG AAAGCAGATT CCCAGGATTT GTCATAGATA ATTTTTTTGT ACCCAACCCC 
23281 TAATCTGACA CCITCACGTA TGTAATATCC TTTAGCATAG GGAACAAAGA GCGTTACTGT 
23341 GGTTTCAATA TCAGATAACA TTCCTTCGTA ATAAGGTTGT CTGGCAGAAT TGCCATCAAT 

23401 ATTCCCAATA TGGATCTTAA ACCAACGTTC ATCACCATGC TCCTCTTTAT TGTAGGGGGG 
23461 CAACTTAAAT GTCGCATAAA ACCCTTCACC TAATTGCGGC TCTGGTAAAT TTTGCGTTTC 
23521 CATACTTAAA ACATTATCAA TACCAATATT GGCTCTTTCA GCTAATTTTC TGGAAAATAA 

23581 AGTATTTAAC CGGGTTCTGT AAGGGCCAAT CTGCATATAT TGTGTGCCTG ATGGCATTTT 

23 641 ATGCAGTGAT ATAACGTTAC TTGTATCTTT GGATTTTAGT TTTATATGAA TTGGCGATTC 

23701 AATAACAATA TCGTTATAAC CGCCGTCGGG TTGCTTAATA ATAAACTCGC TCACCAGAGG 

23761 AATATCATAG CCTTCAATAT CAACTTTTAC TTGATTAAAA TCATATACCA TAGGGTCAGA 

23821 TTCGTGTGAA GGTTTAGATG CCACATGGTC TTCAGCATTT AACTCCACTA GAATATCAGA 

23 881 GCCATTTTTT AATAAAAAAC TAATGTTTTT ATCTTGGATC TGTTCGATCA TAGATGAAGC 
23941 AAGTTTTATT ATCTGTGGCT GGTTGAACAT AAATACACCC ATGGATCCTC GCGAAGGAAC 

24 001 AGTGCCGCAA TATTTCCCAT GTTATTAATG ATTGAAACAT CATTAGTAAA TGATTCACAT 
24061 ATAGTATGCC ATACTCCTGT GTTATCTTTC CAATCTAATA CTATGTTAGT ATCAAGTTTG 
24121 AATTCAGCAT CATCTGATTC ATAATCATAA TTTATACCAA CTCCAATTTC TGATTTTCTA 
24181 GGAATTTTTT CCTTGGTTCT TAGATGCATT AACACTCTAA AATATTCGGC ATTTTTAAGA 
24241 TCGATGGAAA TAATAAAATC CAAAGTTCCA TAATGAAAAA CTTCTTC T T C TTTTCCAAGC 
24 301 ATTTCATCAT GTCTATCATA ATCAAATAAA ATAACCGTTT CATCTTCTAC CATCGATAAC 
24361 AGGTATTTAA CCTCATCATT ATATATATTG CCTTTTGAAA AATTAATTTC CATTGAAGGA 
24421 TTGAACGTTA AATTAATATG ACCATTT CCT GGTGATATAT ACGAGAGATC AAAAATATTT 
244 81 CCGGTAAAAC TGGCTAATTT ATTTTTTGTG GTTATAGATT CCTTATATTC GGCCAAATAA 
24541 TCTGTAGCAA ATTGATTGTT GACTTTGTAT TCTGTCCTGG TATCAAGTTC TGATAATGTG 
24601 CTrTTAACAA TGGCGTCTAA ATCATTTTCT GTGAGAATGG ATAATGTCAT ATCAGGGTTA 
24 661 ATGGTCATCC CTTCTCTTGC AGGAAGACTA T7AAAAGAAT AATTGTCTTT TTTCTCATGG 
24721 AAATAAACAA TAATGACGTC TTTTTCATAA TCAGAAGAAC AATACATACC AATGCTGGCT 
24781 TTTTTATTGA TCAGGTTTTC TATTTTATCA GTCACATTAA AATTAAACGG TGAGCTCCAG 
24841 CTGCCA TCAT AACGAATATG TGACAGTTTT AATATATAAT CAGTGATATC TATCTTGCCA 
24901 TCTTCACTTT CATTTTTCAG CTCTTTTTGT TCCAGCCACA GTAAATACAA ACGAGACTTG 
24961 TAAATAACAG GTCTGATATT TTCCTGCCAT ACATTGATGG GTATTTCAAT TTTTTTCCAT 
25021 TCTCCCCAGG CATTGGCAGC AAATTG AC CG TGCTGGCACT TTTGGTGATC GACATTGCGC 
25081 CAATAATATA TTCTGGGTTC TGTCTGGCTA TAACCAATTA AATAAGTGAG CCCCTCATTG 
25141 ACATTAATAC TGTCATGATA TCCGCTAATC ACCTGCAAGT TAG CGA CATC TTCAAATGCG 
25201 GTCAGATAAT TTTTAAAGCT ATCTTCAACG GTATCGATAT TTAACTGACT TTGGGAAAGT 
25261 TGCTGTAACA GGTTGTTCAT CATACCTGTC TGACCAATAC 
25321 TTTTCCGGAT AATAGGCCAG TTCAGATACG CCGGCCCAGG 
25381 GTTTCCCAGT CGCAGAAGAA CTGACGGGTT TTCACTGGCT TTGATACTTT TCCTTCAACA 
25441 TTATTCAACG CCCGGTTGAC ATATAACTGA ATGCTGGCAA TGGCTTCTGC CACACGGGTG 
25501 GTTTTCACTT GGGCAGAAAC TTGGTTATCA ATCAGCAGAT AGCTGTACAA CTCATCCCGG 
25561 CTCTTAATCT GTTGAGGTGC ACCATTTTTG ATGTAGTAAG CACTGGCCGC TGTCGTCGTG 
25621 GCTTCATCCA GCCATGCCTG AAGCTGGTCG GATTGTTGAC TGTTCAGTCC CGCCTGCAAC 
25681 AAAGTACTGG CGGCTTGCCA ATCATCAAAT GTTGGCATCG GGGTTTCCGG TTCACCGACA 
25741 TATTTTAATT TTATGAGTGC AGCAACACCA TCCGGGGTAA TACCCAATGT AGCAGCGACA 
25801 TCCAGCCATT GCAGAGTGAC ATCTATAAGT TCTCCAGTTG GTAAAGGTAT TCACTCCCAA 
25861 ACCGGTCTGT TGCAATGCTT GTGTCACAAC CTGAGCATCA AAATTTTAAC GCCACCGCCA 
25921 AATTGTTCGG CAGTCAACGC TCCTAAGTTC CAAATGCTGT TAAGATTCTG TCGCGTAGCT 
25981 TCACAACGCA TGATCACAGC ATGGAAGCGG GTCAG CGCT T GCAAAGTGGG GAGATCATGT 
26041 TGCAGTGCTG TGGTTTCTGA TTGGAATTTC TCCGGTTTTG TCACCAACAG GGTCAGTTCG 
26101 TTTTCGCTGA GTCCAATATT GCGCACAATC AGAGAAAGTT GCCCCAGTAC CTGACAAAAA 
26161 GCCACCATGT TGCTGGTTTC ATTCTCTGAG CGATCACGGT TAGCCGCAAT AATCATGAAA 
26221 TCATCGAATG TCAGTCCTTG TGGTTTTATC TGATTAATCC ACAGCAAAAT AGTTTCTGCT 
262B1 GTTTTGGCTG AATCCATTTG AATGCTGGCA GCAATCAG CG GGGCAGCTGC ACGGATCAGT 
26341 TCGTCATCAC CGAGTGAAAG TGTTGATAAT CCATTACTTA GTGTCGTGAT AAGGTTTTCA 
26401 ATATCCGGCG TAAGGACAGT GCTGTAATTA TCCGTGGTCA TCAGAAACAC ATCACTGACA 
26461 GACCATTTCT GTGTTGTCAG CCACTGGGTG CATTGGAACA GAAAGCTGAT TAATTGCGTT 
26521 AATGCTGTAT CAGAAAAAAG GGCAATTTTC GTGTTCACAT AGGGAGAAAC CGACAACAAC 
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26581 ATGGATAATT CATTCACTGT CAGATGATGA ATGTCTGCCA GCAGACGAAC GCGATAAAGC 

26641 AGAGACAGGT TCTCGATGGA ACACATAAAT TCTGGATTTG TTCCGCCATT AGCCAGTTTC 

26701 CATAATGTAT ACAGTTCAGT ATCATTCACT CTGAAAGCAC GTTTCATTAT TCCCAAATAA 

26761 AAATGGTTTT TTGATTCACC GGGGGTTAAA TCCAGTTTGG TATTATCAGC AGAAAACTCT 

26821 TGGCCATTTA ATAGCGGTGT ATTGAACAGC ATTGTAAAAT GACTGGGTTG TTGTTTAGTG 

26881 GAATATTGGC TGATATCTGA ATGACACAAT ACCAGCGCAT CGCTGACGCT AATATTATAG 

26941 TGCTGCATAT AA TATT GAAC ATAAAACAGC TTACCCAACA CATTGCTGTC AATGGTTAAG 

27001 TCATCATAAA TACTTTCTAT TACTTGCCAG ATATCTTCTG GAGATATGCC TGTCGCTTTA 

27061 TACAAACGAA TCGCTTTATT CAGCTTTAAC AGGAATATAT CACCGGGAAC TCCATCATTT 

27121 TAAAGTGTGC ATTGGCATTG ATAGCATCCG ACGGATTTGG TTAACTCGCC ATAAGCGGAG 

27181 TGTTATACCG TTGGT GATTT GCTCTGTCGT CAATTTAATG GGAATACTGT AATGGGTATT 

27241 AGCAATGGGG ACGAAATTTT TA TCTTGGTA TATATATTCT TTATCTCCAT TCTGGAGACG 

27301 AAAATCCAAG TGGTCAGGTT CTGTTTTTTT TACACTGAAA TTATATTTGT ATTCATTTTC 

27361 TTTGATTGGA ATTAGCTCTG CATAGTTTAA ATGTGAATCG TAGAAATCTT TGCGGGTTCG 

27421 CTTAATCAAT CTTGCCGTTG CCGTATCATT CCCGTCATTG ACCAATGTTA TCAGTTGCTC 

27481 ATTCTTATAC TGTTGATTTG TATTTTTCTT ACCGAAGGAG AGATTGACAA ATAAACTGAG 

27541 TTCATCATAA GACAAATCGT AGTAGCGAGC CAAAGAAGCA TAACTCTTAA AAATCAGTAC 

27601 ATCATCTGTA CCGAAATTTT TCTTCATCAG TTCTGTTGAA TTTTCCGGTG TAATTTCTTC 

27661 TACAAGGATT TGATACAATT CAGGCGATAT ATCAGTCTTA ATAGCCAGTA GCGATGTTGG 

27721 GTCCATTAAT TCCGCTACGT CTGTATTACG GCTAAATGCG GTGAGGTTTT TATCTTGCAA 

27781 TAAAATTGCC TGACGGGCTG ACTCATACGG CAGATGATAG GGTGTCATGC CGGTTTGCCG 

27841 GTAAGTGGAC AACATTTTCA TTACACCGTT ATAGTCAGTT TTCTCTAACG TCTGAATATT 

27901 ATGCAGCAGT AATTCATTAG ATAAGGATAA TGTGGAAATT TCTTCATCCA TATTATTCTG 

27961 TGTCAGTGCC AGTGAAGCAA TGTCGGGGCG TCGTTTATTC AGGTGATATT GAGAATTGTC 

28021 AGGATGAAAA TCTTTCGCTT CCCGATATAA TTCTGTTAAA TAAGCCGCTG GTGAAAATAT 

28081 GGAAGCAATT GATCCCGGTT TTACAAAACG GTGGG CGCGG CCATAAAACC AACTGTTGTA 

28141 ACTATTGTTT AGGGTTGACG GTGTAATATT AAGGTTAGTG ATATTAGCCA GTTGTGGATT 

28201 AGCACGGGAC AAAATGCGCA GTTCTTCAAG TTTATTCTGT TTTGATTCCT GATGAGCCTG 

28261 TTGATATAAA AAGTCTGTTT CTCGCCACGT CAGAGTTCCA CTTGTCCTAT GACGAAATTC 

28321 GCTGAAAGAC ATAAACGAAA TGTTTGTCAA TAATAAAGTA TCACCAGCCT TTTTCTATTT 

283 81 ATCTTATCTA ACAGTTCATT AACTTTTATC ATATAAATCC TTAAGTTATT GTCAATTTAA 

28441 TGATTAATGG TTTTTAGGTG GAGATTATTA TAATCTGATA GGAATATTAT GGTTAATTAA 

2 8501 ATTGATACTG ATTTATCGCT CTATTCTTTC AATAAAAAAT AAAGAACTTC CCTATAATAC 

2 8561 ATGGATTTAA ATAATGAATA CCGTATGTTA AAAATTAAAT TTTAACAAAC TTTCATGAAA 

28621 AAATTCAACT CAACAATTGT TTAAATATTT TTAATTGTGT TTGTGCTGTT TGAAAAATGA 

28681 ATGACTAATA TTTATCTATG AAAGATTATT TATTGAGGAT GTCTTGCTTG GTTTCAGGGG 

28741 GCTACGTTGG AGTCAGATAA ATGTGTGCAA AAAGAAATCC TTAATAAAGT TGCGTAATTA 

28801 CAAAAGTTGG TATATCGTGA CAAGAGTGAT AGTAATGTCA CATAATTTAT TGAATACCCG 

28861 AACCTCGCAA ATGCGGGGTT TTTCTTCGCA TAATCAAAGA GAAAGCTATG AAAAAAACAC 

28921 TGATTACTCT TATTCTCAGT ACCCTTTCTT TTGGTGCTTT GGCACAGCAG GGTGG CTTCG 

28981 TTTCCCCGGA CAGCACAGAC TATACTCAGG GTGGATTTAA AGGTCCAACT CCCAACCTGA 

29041 CCAGCGTTGC TCAAGCAAAA TCTTTTCGTG ATGATGCGTG GGTTGTTCTG GAAGGAAACA 

29101 TTGTTAAACA GGTTGGTCAC GAACTCTATG AATTCGCGGC CGCATAATAC GACTCACTAT 

29161 AGGGATCGCT TATTACGGAC TTATCCGGAA AGCTATCTGG AACCCCTGTT ACGCCTGAAT 

29221 AAAACAGAAT TCAGGGATAA CAGTGGTTCT GTTTATGTTG ACATTGATGA TAAGCGCTGG 

29281 ATGGGTCTGA CGGCCACTCC AACTGACAAA GTTCGTATCG AAGGTGAAGT GGACAAAGAC 

29341 TGGAACAGTG TTGAAATTGA TGTCAAAACT ATCCGCATAG TGAAATAACT CAAGCACTTT 

29401 GAATATAGCC CCGCACTCGC GGGGTTTTTT GCTTTCTGGG AGTCGGAAGT TTAACCGTAG 

29461 TGACGAGGAT CAAAACTAAG TTAACGGCAG TGGTCACTGA TTTGGTGCAT AAGTTATCAA 

29521 AAGTTAAAAA TCAAAACTTA TTTTTTATTT AATAGAGGAA TGTCACCCTG TAGGTGAATA 

29581 ACGTTGACGG ATGTAAATAT ACAGTATTAT AGTCCTTTGA TATGTTATTA AATTGAAAAA 

29641 CCTTTAAACT ATATTCGGGG GAAATTATTA TGTCAGATGT TCGTAATATT ATTAATGTTG 

29701 ATAACAATTT TGGTTGTGAA TATAAAGCGG ATTTATTTAA ATAAGTTTTC ATAATTGTGA 

29761 TACACCCATT TTTCTCATCC CCGGTTTTTG CTGTTGTAAG GAAGCGGTTT CCATGAAGAT 

29821 TTTGACATGG TTAAGCAACT GCCACATAAA TTGGCAGCAG TGGTTTCGTG TCACGGTTTC 

2 9881 ATGCAAGGAT TGCCATAGAC GTTCAATTTT ATTCAACCAC GGGCAATAGG TCGGTAAAAA 
29941 GAGAAGATTA AATTTGGGAT TCTTTGCCAG CCAAACCCTG ACCTTCCGGC TCTTATGAAT 
30001 GCAATAGTTA TCTAAAATTA ACGTGATGGT TTTGGCATTA ACATATTGAT TGTTAATTTC 
30061 ATCTAACAAT TTGATAAATA AATCTGAGTT CTTTC TCAAG CTACCGACAT AAGTGATTTC 
30121 TTTCGTTTTC GCGTTGAGGC AATTGGCAAG GTAGTGTTTT TGGTTCTTTC CGGGGGTAAC 
30181 AACACGCTTT TGTTGCCCTT TGAAGCACCA GTCTGCACCG ATTTTCGGGT TCAGGTTGAT 
30241 GTCCACCTCA TCCTCATAGA AGACCGGGTG TTTCTCTTGA GGCATTGGAT AACGTCTCGC 
30301 TGATTTTTGC CATTTTTTCA TCATACTCAG GGTCAGGCAA TTTTACGGTT GGTGCCGCCC 

3 0361 TTC3CCAAAC GATGCCCGTC CGGCAAAAGT AGCGATAGAG GGTACTTTGA GAGAGCGATG 
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30421 TATTCAGTAG CTCATTGATT TTAAGTGTAA TAAGCTCAAG GCTCCATCGT GAACGGAGAT 

3 04 81 AGCCAAAATG TTGTGGCGAG TGCTGTAATA AGAAAGAAAT GACTGTGAAG AGCGGAGCTA 

30541 AGTTCCAGAT GGCAGGCCTT CCCGCCGGGA GGCTTTTAAG TCCTTCCAAC CCGTATAATG 

30601 TTAACCAATT TACCCAACGA TGAACGGAAG AACGTGAACA GTGAAGCGTT CTGGAAACGT 

30661 GAGAAACCGT ACTCCCTTCA TGTAACATCA AGAGCGCGGT GAAGCGACGT GCATAGTCCT 

30721 TATCCCGGGT TTTCTGGATA GCTTTTTTCA TCGGACGTCG TTCATTTCGG GGTATTGATG 

30781 TTATGATTGG CATGACTCAG TCCATTTTGG GATTTGTTTT GATTTGGCGA TTAATCAGAT 

30841 CGCGAAAATC GGACTGAGTT CCCTTCAAGT GATCTACTAT TTTGAAATCT TATTTAATCA 

30901 GGAGTCAGCA AATGAGTTAT TCCCCATAAT ACCTGACCAT GTGGTTGTTT ATCCGGGAAA 

30961 TGATTCATCT ACCGGTGGTA TGTGGATTCC TTGGTGCGAT AGTCAGAAAG ATATTGACTC 

31021 TGGCCATTAT ATCAAAGTTA CTTTCAGTAA AAAGGACGCT GCTGATATTG TGAACTACAT 

31081 GTTTCAACAT GGCAGTTATG TTTATTTTAC AGACAGTAGT AAACAATTTA GCAATAAGCA 

31141 AATTATGTCT GGTGATTCAG CTAAAGGCAA AGGGGATTAT AAGCTTGAAA TTAAAACAAA 

31201 CGGGAACCTT CCACTGATGG TATTGAATAA ATATTGATTC ATTATTATTT ATGGATAAGA 

31261 AATTAAGTTT ATATTTCATC TGGTTTCTGC AAT TAAGT TT TAAAAATTAA TTCTACTTTT 

31321 TTTATGGTTT TATATTTAAT GCCAATCATA TTATTTTTCT TATAATAATT GATAGTTTAT 

31381 TTATATAGTA AATAAATTCT GTTGGATGTG ATTATTATTG TGAGACGGTA ATAATTAACA 

31441 TAACAGAAAA TTCATGGTTA GGAAATTCAA TCAACTTTTG TCCGGTTTCC TGACCATGAA 

31501 GAGCTGTATT TACTGTAGAA CTCGCATTGA TACTGGATTG ATTAG CCGGA CGAGTGTTGG 

31561 GTCAGCAGAT AATATGTTGT ATATTGGCTG TGGATTTTTC AGCGAGATGA TAGCTTTGGC 

31621 AGTAAAGGCG ATTAATAACC GATAAAACAG AGAGACGGAT TGTGGCCAGG AAAGCAAAAA 

31681 AGCCTCACCA TGACGCGTTA TTCAAACATT TTTTAACCCA ACCAGAAACC GCCCGGGAAT 

31741 TTTTATCCCT TTATCTGCCG GAAGCGATCC GGTCAGTGTG TGATTTACCA CACTAAAACT 

31801 GGAACCGGCA GCTTTGTGGA CAGGCAATTA CGTCAGTTGC ACAGTGATGT GCTGTATTCT 

31861 GTCGAGACAA CCCACGGGGA CGGTTACATT TATTGCCTGA TTGAACACCA GTCCACGCCT 

31921 GATCCGTTAA TGGCCTGGCG GC TGAT GTAT TATTCGCTGT CAGCCATGGC TGCGCATCTG 

31981 AAAAAAGGAC ATACTGAACT CCCTTTGGTC GTCCCCCTGC TGTTTTATCA TGGTGAGGTG 

32041 AGGCCTTACC CTTACTCAAA TCGATGGCTG GATTGTTTTA CACTCTCTGA ACACGCGGCT 

32101 CACCTGTATA ATCAGCCCCT GCCGTTGGTG GATATCAGTG CGCTCAGTGA TGAAGAGATC 

32161 CTGACACATA AAAGCATTGC CTTGATGGAG CTGGTACAAA AACATATCCG TTGCCGGGAT 

32221 ATGCTGGAGT GGGTTCCCCA ATTGGTGGCG TTGTTGAATG CCGGTTATAA TAGCGCCGAA 

32281 CAGCGCCATG TTGTGTTAAG CTATATTTTA CTGAATGGAC ATACGCTGGA TCTCGCCCAG 

32341 TTTGTCCATC AACTGACTGA ACAATCTCCG GAGCATGAAA CCATGTTGAT GACTATTGCA 

32401 GAACAGCTTG AACAAAAAGG GCGTGAGCAA GGCCGGACAG AAGGCAGAAC AGAAGGCAGA 

32461 GCTGAAGGAC GGG AAGAAGG CAAGCTGGAA ACGGCGCGCG CATTATTACG GCATGGTGTC 

32521 AGTCTGGACA TCATTGTCAC CAGTACCGGC CTGAGCCGGG AGAAAATTGA AGCGTTAAAG 

32581 CATTAAATGG ATACGCTTTT TCACAGCAGG ATA7GGTGAC CCCTGTGAGG CCACCGGAAA 

32641 ATTTTATTTA CTACGATTTA CGACGGGTTA CT7TAGGAAG CTGAATGAGA CGTCCTTTGT 

32701 TATATAACGG TCCCATATCA ATCTTCTCTT TTCCGCGTAC AGGTAAGTAA CCCAAACCTT 

32761 CGTGAGCAGC ATTTGCCAAC AGGCCATCAT CCTGATCGCC TGACCAAGAG AAGATCCCGC 

32821 CCAATTTCAT TTTGGTTGCA TAAATTCCCT TATGCAGCAC AGTGCGGGGC GTATCCAGTG 

32881 AAATCCAGTG ACCACCGTCA GCATTAAAGA GTGCGTCAGC GTCGGTTTCC GTGTCTGTCA 

32941 CCAGTTCAAA CTGATTTTTC CCGCGTGCAA TTTCATATTC CG CAT CG TAT TGGTTATTCA 

33001 GCAGACAGAA GAATTCCGGA GCACCTTTTT CCATCGTGCC CAGTGGCTCT CCTGTTCTGT 

33061 TATAGCGGCG CGTTGTCAGA TCAGCACCCA GACATGAACG TCCATAGTTA GCAAATCCGA 

33121 GGTGAATTTT CTCCGGTTGT ACACCTTGTG ACAGTAAAAA GCGGATCGCC TCATCTGCCG 

33181 AGTAATCCAT GTCCCGATCA GGATTGGGCG GAGGAGGGTT ATCGCCGTCA TATTCATATC 

33241 TGGGGGGATA CAGGTTAGTA TGGTGACCGA TGTATTCTGC CCAACCGGTA CCAAAGAAGT 

33301 CGTAGGTCAT CACAAAGATA TTGTCTAAAT AAGGTGCGAT TTCTTTGAAG CTGGACTTCT 

33361 CCATTTTGGC AACGACGGCG CTACAGGCTA TCGTGATTTC TTTACGGGCC CGGGTTCCAA 

33421 AGGCGATGTT CAGTGCTTCA CGCAGCTCTT TCACTAACAA AACATAGTTT GGGCCATCAT 

33481 GTTCCGGGTC GAATTCATTA CCTTCTTCAC CTGTGGCGCC GGGGTATTCC CAGTCGATAT 

33541 CCACCGCAGT AAACATGGGA AAACGCCGGG AAGAAGTCGA CGATGCTACT CACAAATGTA 

33601 GCACGTTGCT CAGGATCTTT GGCCATCACA GAGAAATACC CTGACATACT CCAGCCGCCG 

33661 ATACTGAATG CGAGTTCCAG CTTATGCCCT GCCTGTTTTG CTCGCGCTTT CAGATTACGC 

33721 AATCCCCCCA GTAAACCGGA GGCTGCATCC TGATTGTAAT ATTGCAAGAA ATTCTTCGGG 

33781 CTGGCATCAC GGCGCTGATC CGCGTCCAGA CCGACATTGC GTGTGGTGCC TAAATCACCA 

33841 TAAGGATCAA CGGGTACAAT ATGGCCTAAT GTAATAGGGG CAATCTGGCC ACTGCTGGCT 

33901 TCTGCTTGCC GGTTCCACCC GTCAACAACC TCATTAATCC GTTCGGATAA CTTGCCTTTG 

33961 TCACCGTTGA CGG CCATAAA ACTGAAAATC AGGCGGTCGT AGGCGGTAGG CGGGATTTTT 

34021 TCCAGATCAA AACCACGGCC GGGGGCATCG TCGCTGGTCA GCGCAGTGTT ATCCTGGGTT 

34 081 TCTGGCGACA AACGCGCATC ATACTGGCAC CAGTCAGTAA TATAGGCAGA GACTTTAGGC 

34141 AGCGGTTCTG TATTTTCCGG ATCAACTTCA TATTCGTTGT ACAGGGACTT GGCAACACGT 

34201 GCTGAAGAAT AACTCAAAGG AGTTCCGCTG CCGTCAGGTT TATATCCCAC CTTCTGATAG 
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GTTTCTTCTG 
GGCGTATTGG 
CAGGCATTCT 
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GCTTTAATTG 
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TCAAAATTGA 
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CGCCGATCAG 
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TGAGTGCATC 
GGTTACCGTG 
CATAA ACCGG 
CGATGTTTTT 
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TATCATATGT 
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GTGTATTCCA 
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AACCGAAGGC 
CAAACGGAAC 
CCATCTGGGC 
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CGTATCAAAT 
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ATATTGCAAT 
ATCGGCAATT 
TAAATCAGGT 
AAAAACCGCG 
AACCAGAGAT 
ATAAACATTT 
ATATCAGTTA 
GCTGAACTCA 
ATCGCTCAGC 
AATGGCGCCT 
GCCGCTGTCT 
AACGTTTTCG 
AGCAGTACTG 
CCCCAGCAAT 
ATAATCGCGC 
AATTTCCCAC 
ACGTCTGGTG 
TTCTCCGACC 
ATCATACTGA 
CGCATCATAG 
TCATTGGTAN 
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TTCGTCATAA 
CAGAGCCAGA 
CATCCGGTAT 
TTGGCATAAG 
TAGGCTCAAC 
CTTCAACTTC 
TGGGGTCTAC 
ATAACGCATC 
ATGGCTGTGT 
CTTCATAACC 



ACCTCGGTTT 
TCTTCCGGTG 
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CTATCATAAA 
GCGCCTACTT 
TGAGACATAA 
GAATTGTCTT 
TTACTTCCGC 
GACTCCAGCA 
GTCCCTTCAA 
TC CATG GCCG 
GGTTTGGTGT 
TAAAGCGTAT 
GCGGGCAATG 
AAGAACATTT 
TGTGATGATG 
TTATGTTGCA 
AGCAGCCCCT 
CCTGTTTCGT 
GCTTGCAGGT 
ANTTGTTCNN 
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3B101 AAAATTATTC AGATAACCGA GCACCGCTTG TTGTACAGAA TCTTCGGTAA TTTTTCCCTG 
38161 ATTAAGGGCA CTTTCCAGTT GGAAGAAGAA TTCTGTTTTA TTCAGGCGTA ACAGGGGTTC 
38221 CAGATAGCTT TCCGGATAAG TCCGTAATAA GCGATCCC 



N=unspecified base 



Fig.3. 



HindE 



BamHl HindE 



HindE 
HindE 
HindE 



200bp__ 
deletion 



HindE 



LUf 



BamHl 

"acl 



HindE 
Sail 



Sad 
HindE 
i BamHl 



HindE 



HindE 
HindE 
HindE 



2Kb 
deletion , 



BamHl HindSi 
j Sad Sail 

11 I I 



HindE 



10 Kb 



SUBSTITUTE SHEET (RULE 26) 



INTERNATIONAL SEARCH REPORT 


rnatloneJ Application No 




PCT/GB 97/02284 



frrr^ C12N1/20 C07K14/24 //(A01N63/02, 

63:02, 63:00), (A01N63/00, 63:00) 



According lo International Patent Classification (IPC) or lo both national classification and IPC 

B. FIELDS SEARCHED 

Minimum documentation searched (classification system followed by classification symbols) 

IPC 6 A01N C12N 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during Ihe international search (name of data base end. where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ' 


Citation of document, with indication, where appropnate, of the relevant passages 


Relevant lo claim No. 


X 


WO 95 00647 A (COMMW SCIENT INO RES ORG 


1.5,11, 




;SMIGIELSKI ADAM JOSEPH (AU); AKHURST RAY) 


13, 




5 January 1995 


18-21, 




cited tn the application 


24-26, 




29,30,32 


Y 


see page 1, line 3 - line 29; claims 10-13 


3,4, 




6-10,12, 






14,27, 






28,31 




-/- 





m 



Further documents are listed In the continuation of box C. 



0 



Patent ramify members are listed in annex. 



* Special categories of cited documents : 

"A" document defining the general state of the an which is not 

considered to be of particular relevance 
•E" earlier document but published on or after the international 

filing date 

1_" document which may throw doubts on priority claim(a) or 
which is cited to establish the publicationdate of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 



T later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step whan the document is taken alone 

"Y" document of particular relevance: the claimed invention 
cannot be considered to Involve an inventive step when the 
document is combined with one or more other Buch docu- 
ments, such combination being obvious to a person skilled 
m the art 

document member of the same patent family 



Date of the actual completion of ^international search 

17 December 1997 


Date of mailing of the internatJonal search report 

14/01/1998 


Name and mailing address of the ISA 

European Patent Office. P.B. 5818 Patentiaan 2 
NL - 2280 HV Rijewtjk 
Tel. (+31-70) 340-3040. Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 


Authorized officer 

Muellners, VI 



Form PCT/tS A/210 (second sheet) (July 1992) 



INTERNATIONAL SEARCH REPORT 



C.(CortlnuatlOfl) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category - 


Citation of document, with indication, where appropriate, of the relevant passages 


Relevant to claim No. 


Y 


CHEMICAL ABSTRACTS, vol. 118, no. 1, 
4 January 1993 
Columbus, Ohio, US; 
abstract no. 3550, 

YAMANAKA, SATOSHI ET AL: "Biochemical and 
physiological characteristics of 
Xenorhabdus species, symblotically 
associated with entomooathoaenic nematodes 
Including Steinernema kushldal and their 
pathogenicity against Spodoptera litura 
(Lepidoptera: Noctuidae)" 
XP002048914 
see abstract 
& 

ARCH. MICROBIOL. (1992), 158(6), 387-93 
CODEN: AMICCW;ISSN: 0302-8933, 
1992, 


3,6 


Y 


DATABASE DISSABS 
STN- Inter national / UMI Company 
STN-AN 96:33246, DISSABS order no. 
AAI9608671 , 1995 

DAVID JOSEPH BOWEN : "Characterization of 
a High Molecular Weight Insectlcidal 
Protein Complex Produced by the 
Entomopathogenic Bacterium Photorhabdus 
lumlnescens (Nematodes, Biological 
Control ) tt 
XP002048915 
see abstract 
& 

DISSERTATION ABSTRACTS JOURNAL 

INTERNATIONAL , 

vol. 57, no. IB, 1995, 

page 93 


4,12,14 


Y 


EP 0 238 441 A (CIBA GEIGY AG) 23 

September 1987 

see page 1 - page 2 

see page 4, paragraph 3 - page 5, 

paragraph 2; claims 10,12,22,36,37 


7-10,27, 
28,31 


X 


WO 84 01775 A (C0MMW SCIENT IND RES ORG 
;BI0TECH AUSTRALIA PTY LTD (AU)) 10 May 
1984 

cited 1n the application 

see page 1 - page 3, line 10 

see page 4, line 24 - line 28 

see page 4, line 36 - page 5, line 3 

see page 14, line 17 - line 29 

see claims 26,27 

-/-- 


I, 4,5, 

II, 13 



Form PCT/1SA/210 (oofltnurtonoJ Mcond (Jirfy IMS) 



national Application No 

PCT/GB 97/02284 



INTERNATIONAL SEARCH REPORT 



mauonal Application No 

PCT/GB 97/02284 



C.(Contlnuatlon) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ' Citation of document, with indication.where appropnate, ot the relevant passages 



Relevant to claim No. 



P.X 



H.HATSUI ET AL. : "Nucleotide sequences 

of genes encoding 32 KDa and 70 kDa 

polypeptides in mba region of the 

virulence plasmid, pKDSC50, of Salmonella 

choleraesuis " 

NUCLEIC ACIDS RESEARCH , 

vol. 18, no. 8, 1990, 

pages 2181-2, XP002050055 

see the whole document 

F. BINDER ET AL.: 

"Cycl odextri n-gl ycosyl transferase from 
Klebsiella pneumoniae M5al: cloning 
nucleotide sequence and expression" 
GENE, 

vol. 47, 1986, 

pages 269-77, XP002050056 

see page 269, the summary 

see page 270, right-hand column, last 

paragraph - page 271, right-hand column, 

paragraph 1 

see fig. 3 bp 2641-2809 

US 5 616 318 A (DUDNEY RALPH A) 1 April 
1997 

see column 1, line 65 - column 2, line 52 
see column 5, line 3 - line 4 

W0 97 17432 A (WISCONSIN ALUMNI RES FOUND) 
15 Hay 1997 

see page 2, line 31 - page 3, line 23 

see page 5, line 1 - line 16 

see page 8, line 23 - line 33 

see page 9, line 41 - page 11, line 14 

see page 17, line 1 - line 21 



21-25 



21-25 



I, 4-6, 

II, 13 



1-32 



Form PCT/TSA7210 (oononuabon ot aacond ahaat) (Jury 1992) 



INTERNATIONAL SEARCH 


REPORT 


national 4 

PCT/GB < 


fcppllcailon No 

97/02284 


Patent document 
cited in search report 


Publication 
date 


Patent family 
member(s) 


Publication 
date 



WO 9500647 A 



05-01-95 



AU 675335 B 
AU 6991694 A 
EP. 0705340 A 
JP 9500264 T 



30-01-97 
17-01-95 
10-04-96 
14-01-97 



EP 0238441 A 



23-09-87 



GB 


2188049 


A 


23-09-87 


AU 


608508 


B 


11-04-91 


AU 


6999287 


A 


17-09-87 


BG 


46006 


A 


15-09-89 


BG 


46752 


A 


15-02-90 


BR 


8701162 


A 


12-01-88 


DE 


3788077 


D 


16-12-93 


DK 


128687 


A 


16-09-67 


EG 


18869 


A 


28-02-94 


ES 


2059404 


T 


16-11-94 


IE 


59456 


B 


23-02-94 


JP 62224295 


A 


02-10-87 



WO 8401775 A 10-05-84 AU 558287 B 22-01-87 

CA 1214130 A 18-11-86 

EP 0126092 A 28-11-84 

US 4672130 A 09-06-87 



US 5616318 A 01-04-97 NONE 



WO 9717432 A 15-05-97 AU 1050997 A 29-05-97 

CA 2209659 A 15-05-97 
EP 0797659 A 01-10-97 



Form PCT/ISM2IO (pawn Unity «m»») (*iy 1 WZ) 



THIS PMaE BfeANK (usrro) 



